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STREPTOCOCCUS PNEUMONIAE PROTONS AND NUCLEIC ACID MOLECULES 

The present invention relates to proteins derived from Streptococcus pneumoniae, 
nucleic acid molecules encoding such proteins, the use of the nucleic acid and/or 
proteins as antigens/immunogens and in detection/diagnosis, as well as methods for 
5 screening the proteins/nucleic acid sequences as potential anti-microbial targets. 

Streptococcus pneumoniae, commonly referred to as the pneumococcus, is an 
important pathogenic organism. The continuing significance of Streptoccocus 
pneumoniae infections in relation to human disease in developing and developed 

10 countries has been authoritatively reviewed (Fiber, G.R., Science, 265: 1385-1387 
(1994)). That indicates that on a global scale this organism is believed to be the 
most common bacterial cause of acute respiratory infections, and is estimated to 
result in 1 million childhood deaths each year, mostly in developing countries 
(Stansfield, S.K., Pediatr. Infect. Dis. f 6: 622 (1987)). In the USA it has been 

15 suggested (Breiman et al, Arch. Intern. Med., 150: 1401 (1990)) that the 
pneumococcus is still the most common cause of bacterial pneumonia, and that 
disease rates are particularly high in young children, in the elderly, and in patients 
with j predisposing conditions such as asplenia, heart, lung and kidney disease, 
diabetes, alcoholism, or with immunosupressive disorders, especially AIDS. These 

20 groups are at higher risk of pneumococcal septicaemia and hence meningitis and 
therefore have a greater risk of dying from pneumococcal infection. The 
pneumococcus is also the leading cause of otitis media and sinusitis, which remain 
prevalent infections in children in developed countries, and which incur substantial 
costs. 

25 

The need for effective preventative strategies against pneumococcal infection is 
highlighted by the recent emergence of penicillin-resistant pneumococci. It has been 
reported that 6.6% of pneumoccal isolates in 13 US hospitals in 12 states were found 
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to be resistant to penicillin and some isolates were also resistant to other antibiotics 
including third generation cyclosporins (Schappert, S.M., Vital and Health Statistics 
of the Centres for Disease Control/National Centre for Health Statistics, 214:1 
(1992)). The rates of penicillin resistance can be higher (up to 20%) in some 
hospitals (Breiman et a/, J. Am. Med. Assoc., 271: 1831 (1994)). Since the 
development of penicillin resistance among pneumococci is both recent and sudden, 
coining after decades during which penicillin remained an effective treatment, these 
findings are regarded as alarming. 

For the reasons given above, there are therefore compelling grounds for considering 
improvements in the means of preventing, controlling, diagnosing or treating 
pneumococcal diseases. 

Various approaches have been taken in order to provide vaccines for the prevention 
of pneumococcal infections. Difficulties arise for instance in view of the variety of 
serotypes (at least 90) based on the structure of the polysaccharide capsule 
surrounding the organism. Vaccines against individual serotypes are not effective 
against other serotypes and this means that vaccines must include polysaccharide 
antigens from a whole range of serotypes in order to be effective in a majority of 
cases. An additional problem arises because it ahs been found that the capsular 
polysaccharides (each of which determines the serotype and is the major protective 
antigen) when purified and used as a vaccine do not reliably induce protective 
antibody responses in children under two years of age, the age group which suffers 
the highest incidence of invasive pneumococcal infection and meningitis. 

A modification of the approach using capsule antigens relies on conjugating the 
polysaccharide to a protein in order to derive an enhanced immune response, 
particularly by giving the response T-cell dependent character. This approach has 
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been used in the development of a vaccine against Haemophilus influenzae. There 
are issues of cost concerning both the multi-polysaccharide vaccines and those based 
on conjugates. 

A third approach is to look for other antigenic components which offer the potential 
to be vaccine candidates. In the present application we provide a group of proteins 
antigens which are secreted/exported proteins. 

Thus, in a first aspect the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 2 
herein. 

A protein or polypeptide of the present invention may be provided in substantially pure 
form. For example, it may be provided in a form which is substantially free of other 
proteins. 

In a preferred embodiment, a protein or polypeptide having an amino acid sequence as 
shown in Table 3 is provided. 

The invention encompasses any protein coded for by a nucleic acid sequence as shown 
in Table 1 herein. 

As discussed herein, the proteins and polypeptides of the invention are useful as 
antigenic material. Such material can be u antigenic" and/or M immunogenic" . 
Generally, "antigenic" is taken to mean that the protein or polypeptide is capable of 
being used to raise antibodies or indeed is capable of inducing an antibody response in 
a subject. u Immunogenic" is taken to mean that the protein or polypeptide is capable of 
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eliciting a protective immune response in a subject. Thus, in the latter case, the protein 
or polypeptide may be capable of not only generating an antibody response and in 
addition non-antibody based immune responses. 

5 



The skilled person will appreciate that homologues or derivatives of the proteins or 
10 polypeptides of the invention will also find use in the context of the present invention, 
ie as antigenic/immunogenic material. Thus, for instance proteins or polypeptides 
which include one or more additions, deletions, substitutions or the like are 
encompassed by the present invention. In addition, it may be possible to replace one 
amino acid with another of similar "type". For instance replacing one hydrophobic 
15 amino acid with another. One can use a program such as the CLUSTAL program to 
compare amino acid sequences. This program compares amino acid sequences and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. It is 
possible to calculate amino acid identity or similarity (identity plus conservation of 
amino acid type) for an optimal alignment. A program like BLASTx will align the 
20 longest stretch of similar sequences and assign a value to the fit. It is thus possible to 
obtain a comparison where several regions of similarity are found, each having a 
different score. Both types of analysis are contemplated in the present invention. 

In the case of homologues and derivatives, the degree of identity with a protein or 
25 polypeptide as described herein is less important than that the homologue or derivative 
should retain its antigenicity or immunogenicity to streptoccocus pneumoniae. 
However, suitably, homologues or derivatives having at least 60% similarity (as 
discussed above) with the proteins or polypeptides described herein are provided. 
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Preferably, homologues or derivatives having at least 70% similarity, more preferably 
at least 80% similarity are provided. Most preferably, homologues or derivatives 
having at least 90% or even 95% similarity are provided. 

In an alternative approach, the homologues or derivatives could be fusion proteins, 
incorporating moieties which render purification easier, for example by effectively 
tagging the desired protein or polypeptide. It may be necessary to remove the "tag" or 
it may be the case that the fusion protein itself retains sufficient antigenicity to be 
useful. 

In an additional aspect of the invention there are provided antigenic fragments of the 
proteins or polypeptides of the invention, or of homologues or derivatives thereof. 

For fragments of the proteins or polypeptides described herein, or of homologues or 
derivatives thereof, the situation is slightly different. It is well known that is possible to 
screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions 
which are responsible for the protein or polypeptide's antigenicity or immunogenicity. 
Methods for carrying out such screening are well known in the art. Thus, the fragments 
of the present invention should include one or more such epitopic regions or be 
sufficiently similar to such regions to retain their antigenic/immunogenic properties. 
Thus, for fragments according to the present invention the degree of identity is perhaps 
irrelevant, since they may be 100% identical to a particular part of a protein or 
polypeptide, homologue or derivative as described herein. The key issue, once again, is 
that the fragment retains the antigenic/immunogenic properties. 

Thus, what is important for homologues, derivatives and fragments is that they possess 
at least a degree of the antigenicity/immunogenicity of the protein or polypeptide from 
which they are derived. 
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Gene cloning techniques may be used to provide a protein of the invention in 
substantially pure form. These techniques are disclosed, for example, in J. Sambrook 
et al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). 
Thus, in a fourth aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1. 

In a fifth aspect the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 



(ii) a sequence which is complementary to any of the sequences of (i); 
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(iii) a sequence which codes for the same protein or polypeptide, as* those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
5 and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a protein 
as defined in Table 4. 

10 The nucleic acid molecules of the invention may include a plurality of such sequences, 
and/or fragments. The skilled person will appreciate that the present invention can 
include novel variants of those particular novel nucleic acid molecules which are 
exemplified herein.„..Such variants are encompassed by the present invention. These 
may occur in nature, for example because of strain variation. For example, additions, 

15 substitutions and/or deletions are included. In addition, and particularly when utilising 
microbial expression systems, one may wish to engineer the nucleic acid sequence by 
making use of known preferred codon usage in the particular organism being used for 
expression. Thus, synthetic or non-naturaily occurring variants are also included within 
the scope of the invention. 

20 

The term "RNA equivalent" when used above indicates that a given RNA molecule has 
a sequence which is complementary to that of a given DNA molecule (allowing for the 
fact that in RNA "U" replaces "T" in the genetic code). 

25 When comparing nucleic acid sequences for the purposes of determining the degree of 
homology or identity one can use programs such as BESTFIT and GAP (both from the 
Wisconsin Genetics Computer Group (GCG) software package) BESTFIT, for 
example, compares two sequences and produces an optimal alignment of the most 
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similar segments. GAP enables sequences to be aligned along their whole length and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. 
Suitably, in the context of the present invention compare when discussing identity of 
nucleic acid sequences, the comparison is made by alignment of the sequences along 
their whole length. 

Preferably, sequences which have substantial identity have at least 50% sequence 
identity, desirably at least 75% sequence identity and more desirably at least 90 or at 
least 95% sequence identity with said sequences. In some cases the sequence identity 
may be 99% or above. 

Desirably, the term "substantial identity" indicates that said sequence has a greater 
degree of identity with any of the sequences described herein than with prior art nucleic 
acid sequences. 

It should however be noted that where a nucleic acid sequence of the present invention 
codes for at least part of a novel gene product the present invention includes within its 
scope all possible sequence coding for the gene product or for a novel part thereof. 

The nucleic acid molecule may be in isolated or recombinant form. It may be 
incorporated into a vector and the vector may be incorporated into a host. Such vectors 
and suitable hosts form yet further aspects of the present invention. 

Therefore, for example, by using probes based upon the nucleic acid sequences 
provided herein, genes in Streptococcus pneumoniae can be identified. They can then 
be excised using restriction enzymes and cloned into a vector. The vector can be 
introduced into a suitable host for expression. 
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Nucleic acid molecules of the present invention may be obtained from S.pnewnoniae by 
the use of appropriate probes complementary to part of the sequences of the nucleic 
acid molecules. Restriction enzymes or sonication techniques can be used to obtain 
appropriately sized fragments for probing. 

5 

Alternatively PCR techniques may be used to amplify a desired nucleic acid sequence. 
Thus the sequence data provided herein can be used to design two primers for use in 
PCR so that a desired sequence, including whole genes or fragments thereof, can be 
targeted and then amplified to a high degree. One primer will normally show a high 
10 degree of specificity for a first sequence located on one strand of a DNA molecule, and 
the other primer will normally show a high degree of specificity for a second sequence 
located on the complementary strand of the DNA sequence and being spaced from the 
complementary sequence to the first sequence. 

15 Typically primers will be at least 15-25 nucleotides long. 

As a further alternative chemical synthesis may be used. This may be automated. 
Relatively short sequences may be chemically synthesised and ligated together to 
provide a longer sequence. 

20 

In yet a further aspect the present invention provides an immunogenic/ antigenic 
composition comprising one or more proteins or polypeptides selected from those 
whose sequences are shown in Tables 2-4, or homologues or derivatives thereof, 
and/or fragments of any of these. In preferred embodiments, the 
25 immunogenic/antigenic composition is a vaccine or is for use in a diagnostic assay. 

In the case of vaccines suitable additional excipients, diluents, adjuvants or the like 
may be included. Numerous examples of these are well known in the art. 



WO 00/06737 PCT/GB99/0245I 

10 



It is also possible to utilise the nucleic acid sequences shown in Table 1 in the 
preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine 
composition comprising one or more nucleic acid sequences as defined herein. The 
5 use of such DNA vaccines is described in the art. See for instance, Donnelly et al , 
Ann. Rev. Immunol, 15:617-648 (1997). 

As already discussed herein the proteins or polypeptides described herein, their 
homologues or derivatives, and/or fragments of any of these, can be used in methods 

10 of detecting/diagnosing S.pneumoniae. Such methods can be based on the detection 
of antibodies against such proteins which may be present in a subject. Therefore the 
present invention provides a method for the detection/diagnosis of S.pneumoniae 
which comprises the step of bringing into contact a sample to be tested with at least 
one protein, or horaologue, derivative or fragment thereof, as described herein. 

15 Suitably, the sample is a biological sample, such as a tissue sample or a sample of 
blood or saliva obtained from a subject to be tested. 

In an alternative approach, the proteins described herein, or homologues, derivatives 
and/or fragments thereof, can be used to raise antibodies, which in turn can be used 
20 to detect the antigens, and hence S.pneumoniae. Such antibodies form another aspect 
of the invention. Antibodies within the scope of the present invention may be 
monoclonal or polyclonal. 

Polyclonal antibodies can be raised by stimulating their production in a suitable animal 
25 host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as 
described herein, or a horaologue, derivative or fragment thereof, is injected into the 
animal. If desired, an adjuvant may be administered together with the protein. Well- 
known adjuvants include Freund's adjuvant (complete and incomplete) ami aluminium 
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hydroxide. The antibodies can then be purified by virtue of their binding to a protein as 
described herein. 

Monoclonal antibodies can be produced from hybridomas. These can be formed by 
fusing myeloma cells and spleen cells which produce the desired antibody in order to 
form an immortal cell line. Thus the well-known Kohler & Milstein technique (Nature 
256 (1975)) or subsequent variations upon this technique can be used. 

Techniques for producing monoclonal and polyclonal antibodies that bind to a 
particular polypeptide/protein are now well developed in the art. They are discussed in 
standard immunology textbooks, for example in Roitt et al, Immunology second edition 
(1989), Churchill Livingstone, London. 

In addition to whole antibodies, the present invention includes derivatives thereof which 
are capable of binding to proteins etc as described herein. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments 
and synthetic constructs are given by Dougall et al in Ttbtech 12 372-379 (September 
1994). 

Antibody fragments include, for example, Fab, F(ab*)2 and Fv fragments. Fab 
fragments (These are discussed in Roitt et al [supra] ). Fv fragments can be modified 
to produce a synthetic construct known as a single chain Fv (scFv) molecule. This 
includes a peptide linker covalently joining V b and V, regions, which contributes to the 
stability of the molecule. Other synthetic constructs that can be used include CDR 
peptides. These are synthetic peptides comprising antigen-binding determinants. 
Peptide mimetics may also be used. These molecules are usually conformationally 
restricted organic rings that mimic the structure of a CDR loop and that include 
antigen-interactive side chains. 
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Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present 
invention. An example of a humanised antibody is an antibody having human 
framework regions, but rodent hypervariable regions. Ways of producing chimaeric 
antibodies are discussed for example by Morrison et al in PNAS, 81, 6851-6855 (1984) 
and by Takeda et al in Nature. 314, 452-454 (1985). 

Synthetic constructs also include molecules comprising an additional moiety that 
provides the molecule with some desirable property in addition to antigen binding. For 
example the moiety may be a label (e.g. a fluorescent or radioactive label). 
Alternatively, it may be a pharmaceutical^ active agent. 

Antibodies, or derivatives thereof, find use in detection/diagnosis of S. pneumoniae. 
Thus, in another aspect the present invention provides a method for the 
detection/diagnosis of S.pneumoniae which comprises the step of bringing into contact 
a sample to be tested and antibodies capable of binding to one or more proteins 
described herein, or to homologues, derivatives and/or fragments thereof. 

In addition, so-called "Affibodies" may be utilised. These are binding proteins 
selected from combinatorial libraries of an alpha-helical bacterial receptor domain 
(Nord et al , ) Thus, Small protein domains, capable of specific binding to different 
target proteins can be selected using combinatorial approaches. 

It will also be clear that the nucleic acid sequences described herein may be used to 
detect/diagnose S.pneumoniae. Thus, in yet a further aspect, the present invention 
provides a method for the detection/diagnosis of S.pneumoniae which comprises the 
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step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as described herein. Suitably, the sample is a biological sample, such as a 
tissue sample or a sample of blood or saliva obtained from a subject to be tested. 
Such samples may be pre-treated before being used in the methods of the invention. 
5 Trims, for example, a sample may be treated to extract DNA. Then, DNA probes 
based on the nucleic acid sequences described herein (ie usually fragments of such 
sequences) may be used to detect nucleic acid from S.pneumoniae. 

In additional aspects, the present invention provides: 

10 

(a) a method of vaccinating a subject against S. pneumoniae which comprises the 
step of administering to a subject a protein or polypeptide of the invention, or a 
derivative, homologue or fragment thereof, or an immunogenic composition of the 
invention; 

15 

(b) a method of vaccinating a subject against S, pneumoniae which comprises the 
step of administering to a subject a nucleic acid molecule as defined herein; 

(c) a method for the prophylaxis or treatment of S.pneumoniae infection which 
20 comprises the step of administering to a subject a protein or polypeptide of the 

invention, or a derivative, homologue or fragment thereof, or an immunogenic 
composition of the invention; 

(d) a method for the prophylaxis or treatment of S.pneumoniae infection which 
25 comprises the step of administering to a subject a nucleic acid molecule as defined 

herein; 



(e) 



a kit for use in detecting/diagnosing S.pneumoniae infection comprising one 
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or more proteins or polypeptides of the invention, or homologies, derivatives or 
fragments thereof, or an antigenic composition of the invention; and 

(f) a kit for use in detecting/diagnosing S. pneumoniae infection comprising one 
or more nucleic acid molecules as defined herein. 

Given that we have identified a group of important proteins, such proteins are 
potential targets for anti-microbial therapy. It is necessary, however, to determine 
whether each individual protein is essential for the organism's viability. Thus, the 
present invention also provides a method of determining whether a protein or 
polypeptide as described herein represents a potential anti-microbial target which 
comprises inactivating said protein and determining whether S. pneumoniae is still 
viable, in vitro or in vivo. 

A suitable method for inactivating the protein is to effect selected gene knockouts, ie 
prevent expression of the protein and determine whether this results in a lethal 
change. Suitable methods for carrying out such gene knockouts are described in Li et 
al, P.N.A.S., 94:13251-13256(1997). 

In a final aspect the present invention provides the use of an agent capable of 
antagonising, inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide of the invention in the manufacture of a medicament for use in 
the treatment or prophylaxis of S.pneumoniae infection. 

The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The examples 
refer to the figures in which: 
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Figure 1: shows the results of various DNA vaccine trials; and 
Figure 2: shows the results of further DNA vaccine trials. 

EXAMPLE 1 

The Genome sequencing of Streptococcus pneumoniae type 4 is in progress at the 

Institute for Genomic Research (TIGR, Rockville, MD, USA). Up to now, the 
whole sequence has not been completed or published. On 21* November 1997, the 
TIGR centre released some DNA sequences as contigs which are not accurate 
reflections of the finished sequence. These contigs can be downloaded from their 
Webster (www@tigr.org) . We downloaded these contigs and created a local database 
using the application GCGToBLAST (Wisconsin Package Version 9.1, Genetics 
Computer Group (GCG), Madison, USA). This database can be searched with the 
FastA and TfastA procedures (using the method of Pearson and Lipman (PNAS USA, 
85:2444-2448 (1988)). 

Using FastA and TfastA procedures, the local pneumococcus database was searched 
for putative leader sequence or anchor sequence features. Relevant sequences were 
used to interrogate for comparative novel sequences. These were: 

(i) already described leader sequences of Streptococcus pneumoniae (from 
proteins NanA, NanB, LytA, PapA, pcpA, PsaA and PspA); 

(ii) the leader sequence of Usp45, a secreted protein from Lactococcus lactis\ 

(iii) new hypothetical leader sequences derived from the searches in (i) and 

(ii); 
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(iv) the anchor motif LPxTG, a feature common to many Gram-positive 
bacteria surface proteins which are anchored by a mechanism involving the 
Sortase complex proteins. 

Provided below is an example of this approach, with reference to the sequences 
derived from the database (see table 1). 
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15 



20 



Usp45 



NanA 



PapA PsaA 



PspA 



PrtP 





A I I 



4121.1 4101.3 4172.4 4106.1 4346.1 4194.1 4172.2 4147.2 



10 4191.6 



4170.8 4175.3 



4137.3 4193.1 4122.1 

▼ ▼ 

4187.2 4188.1 4102.1 



M I 

4136.1 4127.1 4191.1 

1 I 



4119.1 



4119.2 



25 



The protein leader sequences of different known exported proteins were used as a 
starting point for a search of the local pneumococcus database described above. The 
hypothetical proteins found with this search were then submitted to a Blast search in 
general databases such as EMBL, Swissprot etc. Proteins remaining unknown in the 
pneumococcus are kept and annotated. Then the search is performed again using the 
new potential protein leader sequence as a probe, using the TfastA procedure. 
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Example 2: DNA vaccine trials 
pcDNA3.1+ as a DNA vaccine vector 
PCDNA3.1 + 

The vector chosen for use as a DNA vaccine vector was pcDNA3.1 (Invitrogen) 
(actually pcDNA3. 1 + , the forward orientation was used in all cases but may be 
referred to as pcDNA3. 1 here on). This vector has been widely and successfully 
employed as a host vector to test vaccine candidate genes to give protection against 
pathogens in the literature (Zhang, et al., Kurar and Splitter, Anderson et ai). The 
vector was designed for high-level stable and non-replicative transient expression in 
mammalian cells. pcDNA3.1 contains the ColEl origin of replication which allows 
convenient high-copy number replication and growth in E. coti. This in turn allows 
rapid and efficient cloning and testing of many genes. The pcDNA3. 1 vector has a 
large number of cloning sites and also contains the gene encoding ampicillin 
resistance to aid in cloning selection and the human cytomegalovirus (CMV) 
immediate-early promoter/enhancer which permits efficient, high-level expression of 
the recombinant protein. The CMV promoter is a strong viral promoter in a wide 
range of cell types including both muscle and immune (antigen presenting) cells. 
This is important for optimal immune response as it remains unknown as to which 
cells types are most important in generating a protective response in vivo. A T7 
promoter upstream of the multiple cloning site affords efficient expression of the 
modified insert of interest and which allows in vitro transcription of a cloned gene in 
the sense orientation. 

Zhang, D., Yang, X., Berry, J. Shen, C, McClarty, G. and Brunham, R.C. (1997) 
"DNA vaccination with the major outer-membrane protein genes induces acquired 
immunity to Chlamydia trachomatis (mouse pneumonitis) infection". Infection and 
Immunity, 176, 1035-40. 

Kurar, E. and Splitter, G.A. (1997) 'Nucleic acid vaccination of Brucella abortus 
ribosomal L7IL12 gene elicits immune response". Vaccine, 15, 1851-57. 

Anderson, R., Gao, X.-M., Papakonstantinopoulou, A., Roberts, M. and Dougan, 
G. (1996) "Immune response in mice following immunisation with DNA encoding 
fragment C of tetanus toxin". Infection and Immunity, 64, 3168-3173. 

Preparation of DNA vaccines 

Oligonucleotide primers were designed for each individual gene of interest derived 
using the LEEP system. Each gene was examined thoroughly, and where possible, 
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primers were designed such that they targeted that portion of the gene thought to 
encode only the mature portion of the gene protein. It was hoped that expressing 
those sequences that encode only the mature portion of a target gene protein, would 
facilitate its correct folding when expressed in mammalian cells. For example, in the 
5 majority of cases primers were designed such that putative N-terminal signal peptide 
sequences would not be included in the final amplification product to be cloned into 
the pcDNA3.1 expression vector. The signal peptide directs the polypeptide 
precursor to the cell membrane via the protein export pathway where it is normally 
cleaved off by signal peptidase I (or signal peptidase II if a lipoprotein). Hence the 
10 signal peptide does not make up any pan of the mature protein whether it be 

displayed on the surface of the bacteria surface or secreted. Where a N-terminal 
leader peptide sequence was not immediately obvious, primers were designed to 
target the whole of the gene sequence for cloning and ultimately, expression in 
pcDNA3.1. 

15 

Having said that, however, other additional features of proteins may also affect the 
expression and presentation of a soluble protein. DNA sequences encoding such 
features in the genes encoding the proteins of interest were excluded during the 
20 design of oligonucleotides. These features included: 

1. LPXTG cell wall anchoring motifs. 

2. LXXC ipoprotein attachment sites. 

3. Hydrophobic C-terminal domain. 

25 4. Where no N-terminal signal peptide or LXXC was present the start codon was 
excluded. 

5. Where no hydrophobic C-terminal domain or LPXTG motif was present the stop 
codon was removed. 

30 Appropriate PCR primers were designed for each gene of interest and any and all of 
the regions encoding the above features was removed from the gene when designing 
these primers. The primers were designed with the appropriate enzyme restriction 
site followed by a conserved Kozak nucleotide sequence (in all cases) GCCACC was 
used. The Kozak sequence facilitates the recognition of initiator sequences by 

35 eukaryotic ribosomes) and an ATG start codon upstream of the insert of the gene of 
interest. For example the forward primer using a BamHl site the primer would 
begin GCGGGATCCGCCACCATG followed by a small section of the 5' end of the 
gene of interest. The reverse primer was designed to be compatible with the forward 
primer and with a Notl restriction site at the 5 1 end in all cases (this site is 

40 TTGCGGCCGC). 



PCR primers 



WO 00/06737 PCT/GB99/02451 

20 



The following PCR primers were designed and used to amplify the truncated genes 
of interest. 

5 

ID210 

Forward Primer 5' CGGATCCGCCACCATGTCTTCTAATGAATCTGCCGATG 
3' 

10 Reverse Primer 5' TTGCGGCCGCCTGTTTAGATTGGATATCTGTAAAGACTT 
3' 

4172.5 

15 Forward Primer 5' 

CGCGGATCCGCCACCATGGATTTTCCTTCAAATTTGGAGG 3' 
Reverse Primer 5' TTGCGGCCGCACCGTACTGGCTGCTGACT 3' 

ID211 

20 

Forward Primer 5' 

CGGATCCGCCACCATGAGTGAGATCAAAATTATTAACGC 3' 
Reverse Primer 5' TTGCGGCCGCCGTTCCATGGTTGACTCCT 3' 

25 4197.4 

Forward Primer 5' CGCGGATCCGCCACCATGTGGGACATATTGGTGGAAAC 

3 1 

Reverse Primer 5' TTGCGGCCGCTTCACTTGAGCAAACTGAATCC 3' 

30 

4122.1 

Forward Primer 5' 

CGCGGATCCGCCACCATGTCACAAGAAAAAACAAAAAATGAA 3' 
35 Reverse Primer 5' TTGCGGCCGCATCGACGTAGTCTCCGCC 3' 

4126.7 

Forward Primer 5' 

40 CGCGGATCCGCCACCATGCTGGTTGGAACTTTCTACTATCAAT 3' 
Reverse Primer 5' TTGCGGCCGCAACTTTCGTCCCTTTTTGG 3' 
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4188.11 

Forward Primer 5' CGCGGATCCGCCACCATGGGCAATTCTGGCGGAA 3' 
Reverse Primer 5' TTGCGGCCGCTTGTTTCATAGCTTTTTTGATTGTT 3' 

5 

ID209 

Forward Primer 5' 

CGCGGATCCGCC ACC ATGCTATTGATACGA AATGCAGGG 3 ' 
10 Reverse Primer 5' TTGCGGCCGCAACATAATCTAGTAAATAAGCGTAGCC 3' 

ID215 

Forward Primer 5* CGCGGATCCGCCACCATGACGGCGACGAATTTTC 3' 
15 Reverse Primer 5' TTGCGGCCGCTTAATTCGTTTTTGAACTAGTTGCT 3" 

4170.4 

Forward Primer 5' 

20 CGCGGATCCGCCACCATGGCTGTTTTTCTTCGCTATCATG 3* 

Reverse Primer 5' TTGCGGCCGCTTTCTTCAACAAACCTTGTTCTTG 3' 

4193.1 

25 Forward Primer 5' 

CGCGGATCCGCCACCATGGGTAACCGCTCTTCTCGTAAC 3* 
Reverse Primer 5' TTGCGGCCGCGCTTCCATCAAGGATTTTAGC 3' 

Cloning 

30 

The insert along with the flanking features described above was amplified using PCR 
against a template of genomic DNA isolated from type 4 S. pneumoniae strain 1 1886 
obtained from the National Collection of Type Cultures. The PCR product was cut 
with the appropriate restriction enzymes and cloned in to the multiple cloning site of 

35 pcDNA3.1 using conventional molecular biological techniques. Suitably mapped 
clones of the genes of interested were cultured and the plasmids isolated on a large 
scale (> 1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and 
maintenance of genes was confirmed by restriction mapping and sequencing — 700 
base pairs through the 5' cloning junction of each large scale preparation of each 

40 construct. 



Strain validation 
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A strain of type 4 was used in cloning and challenge methods which is the strain 
from which the S. pneumoniae genome was sequenced. A freeze dried ampoule of a 
homogeneous laboratory strain of type 4 S. pneumoniae strain NCTC 1 1886 was 
obtained from the National Collection of Type Strains. The ampoule was opened and 
the cultured re suspended with 0.5 ml of tryptic soy broth (0.5% glucose, 5% 
blood). The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 
5% blood) and incubated statically overnight at 37°C. This culture was streaked on 
to 5 % blood agar plates to check for contaminants and confirm viability and on to 
blood agar slopes and the rest of the culture was used to make 20% glycerol stocks. 
The slopes were sent to the Public Health Laboratory Service where the type 4 
serotype was confirmed. 

A glycerol stock of NCTC 1 1886 was streaked on a 5% blood agar plate and 
incubated overnight in a C02 gas jar at 37°C. Fresh streaks were made and optochin 
sensitivity was confirmed. 

Pneumococcal challenge 

A standard inoculum of type 4 5. pneumoniae was prepared and frozen down by 
passaging a culture of pneumococcus lx through mice, harvesting from the blood of 
infected animals, and grown up to a predetermined viable count of around 10 9 
cfu/ml in broth before freezing down. The preparation is set out below as per the 
flow chart. 

Streak pneumococcal culture and confirm identity 



Grow over-night culture from 4-5 colonies on plate above 



Animal passage pneumococcal culture 
(i.p. injection of cardiac bleed to harvest) 



Grow over-night culture from animal passaged pneumococcus 
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10 



15 



V 

% 

Grow day culture (to pre-determined optical density) from over-night of animal 
passage and freeze down at -70° C - This is standard minimum 



Thaw one aliquot of standard inoculum to viable count 



Use standard inoculum to determine effective dose (called Virulence Testing) 



20 

All subsequent challenges - use standard inoculum to effective dose 

An aliquot of standard inoculum was diluted SOOx in PBS and used to inoculate the 
mice. 

25 

Mice were lightly anaesthetised using halothane and then a dose of 1 .4 x 10 5 cfu of 
pneumococcus was applied to the nose of each mouse. The uptake was facilitated by 
the normal breathing of the mouse, which was left to recover on its back. 

30 S. pneumoniae vaccin a t^afr 

Vaccine trials in mice were carried out by the administration of DNA to 6 week old 
CBA/ca mice (Harlan, UK). Mice to be vaccinated were divided into groups of six 
and each group was immunised with recombinant pcDNA3. 1 + plasmid DNA 

35 containing a specific target-gene sequence of interest. A total of 100 fig of DNA in 
Dulbecco's PBS (Sigma) was injected intramuscularly into the tibialis anterior 
muscle of both legs (50 pi in each leg). A boost was carried using the same 
procedure 4 weeks later. For comparison, control groups were included in all 
vaccine trials. These control groups were either unvaccinated animals or those 

40 administered with non-recombinant pcDNA3. 1 + DNA (sham vaccinated) only, 

using the same time course described above. 3 weeks after the second immunisation, 
all mice groups were challenged intra-nasally with a lethal dose of S. pneumoniae 
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serotype 4 (strain NCTC 1 1886). The number of bacteria administered was 
monitored by plating serial dilutions of the inoculum on 5% blood agar plates. A 
problem with intranasal immunisations is that in some mice the inoculum bubbles out 
of the nostrils, this has been noted in results table and taken account of in 
5 calculations. A less obvious problem is that a certain amount of the inoculum for 
each mouse may be swallowed. It is assumed that this amount will be the same for 
each mouse and will average out over the course of innoculations. However, the 
sample sizes that have been used are small and this problem may have significant 
effects in some experiments. All mice remaining after the challenge were killed 3 or 

10 4 days after infection. During the infection process, challenged mice were monitored 
for the development of symptoms associated with the onset of 5. pneumoniae 
induced-disease. Typical symptoms in an appropriate order included piloerection, 
an increasingly hunched posture, discharge from eyes, increased lethargy and 
reluctance to move. The latter symptoms usually coincided with the development of 

15 a moribund state at which stage the mice were culled to prevent further suffering. 
These mice were deemed to be very close to death, and the time of culling was used 
to determine a survival time for statistical analysis. Where mice were found dead, 
the survival time was taken as the last time point when the mouse was monitored 
alive. 

20 

Interpretation of Results 

A positive result was taken as any DNA sequence that was cloned and used in 
challenge experiments as described above which gave protection against that 

25 challenge. Protection was taken as those DNA sequences that gave statistically 

significant protection (to a 95% confidence level (p<0.05)) and also those which were 
marginal or close to significant using Mann- Whitney or which show some protective 
features for example there were one or more outlying mice or because the time to the 
first death was prolonged. It is acceptable to allow marginal or non-significant results 

30 to be considered as potential positives when it is considered that the clarity of some of 
the results may be clouded by the problems associated with the administration of 
intranasal infections. 
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Table 1 



4101.1 

ATGGAAGACTTAGTGACCTTAGATTU l'l I U 1 1 1 ATO ACAG AACTAAG ATTG AAGCCAATGCCAACAAGTATAGTT 
TTGTGTGGAAGAAAACGACAGAGAAAT TCTC CGCCAA.ACTTCAAGAACAGATACAG 

TCACTCCCCTTCTGATTAAATATGCCATGTTTGATAAGAAACAAAAGAGAGGGTATAAAGAGTCAGCTAAAAACT 
TAGCGAATTGGCACTATAATGACA AGGA GGATAGCTACACACATCCTGATGGCTGGTATTATC G ITn 'CACCATAC 
CAAATATCAGAAAACACAGACAGACTrrCAACAAGAAATCAAGGTTTACTACGCCGA 
TCAAAAGGGACTGTATATGAACGAj\CGCTATCAAAACTTGAAAGCTAAAGAATCT 
AGGTAG ACAG ATTTTCGCTCAACGCAAG ATTG ATGTGGAACCTGT 

AAGAGATGTAATCTCAGAGGGAAGCGTCAAGTGAGAATTGACATGGGATTCGTACTTATGGCCAATAACC^ 
AAATATAGTAAAATGAAATAA 

4101.3 

ATGGGGAAAGGCCA TTGGAA TCGGAAAAGAGTTTATAGCATTCGTAAGTTTGCICT 

ATTGGGAC'1'I'GIXjCAO \ i 1 1 ATTAGGAGGAAATATAGCTGGAGAATCTX7TAGTTTATGCGGATGAAACACTTATTA 
CTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCTXjATAAAGCTTTGGAAACT 

aatatagttgaaaggacagaacaaagtgaacctagttcaactgaggctattccatctgagaagaaagaagatg 

gccgtaactccaaaagaggaaaaagtgtctgctaaaccggaagaaaaagctccaaggatagaatcagvagcttc 

aaatcaagaa aaac cgctcaaggaaga tgcta aagctgtaacaaatgaagaagtgaatcaaatgattgaaga^ 

ggaaagtggattttaatcaaaattggtactttaa>ctcaatgcaaattct 

acgtatctacgtggaaaaaattagatttaccgtatgactggagtatcittaacgatttcgatcatg 

acaaaatgaaggtggacag ctcaac ggtggggaagcttggtatcgcaacactttcaaactagatc 

caagaaaaatgttcgccttacttttgatcgcgtctacatggattct^ 

cattatccaaatggttataj\ccagttctcatatgatatcaccaaatacctt 

ttgctgtccatgcagtcaacaaacagccaagtagccg r 1 GGTATTCAGGAAGTGGTATCTATCGTGATCTGACTTT 

acaagtgacagataaggtgcatgttgagaaaaatgggacaactattttaacaccaa^ 

tggcaaggttgaaactcatgtgaccagcaaaatcgtcaatacggacgacaaagaccatgaacttct 

tcaaatcgttgaacgaggtg gtca tgctgtaacaggcttagttcgtacagcgagtcgtaccitaaa^ 

atcaacaagcctag atg cg atttt agaagttg aaag accaaaactctgqact u i ' 1 1 1 aaatg acaaacctgccttg 

t acg aattg attacg cgi 0 m 1 accgtgacggtcaa ttgg1 1 gatgctaagaagga 1 1 1 1g1 1 1uoi 1a ccgttact 

atcactggactccaaatgaaggtttctctttgaatggtgaacgtatta 

ccatggggcgcttggagcagaagaaaactataaagcagaatatcgccgtctgaaacaaatgaaggagatgggag 

TTAACTCCATCCGTACAj^CCCACAACCCTGCTAGTGAGCAAACCTTGCA^ 

TCACGAAGAGGCCrTTGATACGTGGTATGGTGCCAAGAAACaTATGACTATGGALUI l'l U Tl GAAAAAGATGC 

CACTCACCCAGAAGCTCGAAAAGGTGAAAAATGGTCTGAT1TTGACCTACGTACCATGCT 

CAACCCT GCTA TCTTCATGTGGTCAATTGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCrrTAG 

GTTAAACG' IT I lib" I" l AAGGTTATCAAGG ATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCCGTT 

TCGGTAATGGTAGCGGAGGGCATGAGAAAATTXjCTGATGAACTCGATGCTGTTGGATT^ 

ATTAG\AAGCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTATGGATCAGAAACATCTTCAGCTA 

TGGAAGTTACTATCGCCCTGAACGTG AATTG AAACATAGCAATGGACCTGAGCGTAATTA 

TGG AAA TG AT CGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATXKjA CI ' l ' l ' l G ACCGTG ACAACGCTGGCTATGC 

TGGACAGTTTATCTCGACAGGTACGGACTATATTCGTGAACCTACACCATGGC^ 

AAGAGCTCTTACTITGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATCTCT 

1 1 IL'IUT 1' AAG AAG AAACCG ATGGT ACACXTTCTTCCTC ACTGG AACTGOT 

TAGCTtjACTCAGAAGG TAAG A TTCG^CTTCUTXiCTTATTCGAATGC'r It.' 1'AGTGTAG AATTG' l'l C 1 1 G AATGGAAA 
ATCT CTTGG TCTTAAGACTTTCAATAAAAAACAAACCAGCGATGCGCOCA 
TGAACTTTATCTTGAATGGAAAGTTGCCTATCAACCACK^ 
GAAArTGCTCGAGATAAGATTACGACnKTGGTAAGCCAGCGGCAGTTCGTOT 

GCAGCAGATCG AAAAGACTTGACTTACATCTACTATG AAATTGTTG ACAGCCAGGGG AATG TOO nCCAACTGCT 
AATAATCTGGTTCGCTTCCAATTGCATGGCCAAGGTCAACTGGTCGGTGTAG 

GAACGCTATAAGGCGCAAGCAGATG GrrClUtX j ATTCGTAAAGCATTTAATGGTAAAGO" 1U 1'ltiCCATTGTCAAAT 

CAACTCAACAAGCAGGGAAATrCACCCTGACTGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACT 

TGGTAAGAAAGAAGGACAAGAGAAGACTGTTTTXiGGGACAGAAGTGCCAAAAGTACAG 

CACCTGAAATGCCTACXACIGTTCCO I l ib I'ATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAACCTGGTCTTC 

AGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTAGA^ 

AGT OATT GCICI TAAAT CAGAGCrACCAGTrGTGAAACGTATTGCTCCAAAT^ 

TCTG l'l ' IHCTATG' n ' I ' l'G ATTGATGGAAGTGTTG AAG AGTATG AAGTGGACAAGTGGGAG ATTGCCGAAGAAGATA 

AAGCTAAGTTAGCAATTCCAGGTrCTCGTATTCAAGCGACCGGTTATrrAGAAGGTCAACCAAT^ 

TGTGCTAGAAGAAGCCAATCCTGCGGCACCTCCACT^ 

TACTAGTCAAAAACCAATGCAATACCGCACTCTTCCTTATGGAGCrAAGTIXjCCA 

AAATGCAGCTGTTACAO I ICi'l CAAGCAAGCGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGA 

TGGTGGCCCTCTTCAAACCTATGCAATTCAATTCCTTGAAGAAGCGCCAAAAA 

GAAAAAGCTGACAGTCTCAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACT 
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(XTTCT ATT ACCAGCTGATAAACTAACCTTCTCTA 

GAGTTGCATAAGCCAGGAGGAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTG^ 

CAAGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATrTGCATCAGGAACCA 

AGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGmCCCTAAAACTCATA^ 

CGAAAGAAAAACTAGACTCCTATGAAACATTTGAAGTACTAGGTAAAGTrGAAGGAATTGACCTTC 

CAAAAGTCTCTGTAGAAGGTATCG I'l i CAGTTGAAGAAGTCAGTGTGACAACTCXAATCGCAGAAGCACCACAAT 

TACCAGAAAGTGTTCGGACATATGATTCAAATGCrrCACCrmCATCAGCTAAGGTTC 

AGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTTAGAAGGTACGCAATTAACA^ 

TGTTCGCGTATCTGCTCAAACTG AGCAAGGTGCAAACA ill t'lXJACCAATGGACCGGTTCAGAATTGCCACTTGCC 

TTTGCTTCAGACTCAAATCCAAGCGACCCAGTTTCAAATGT^ 

CCAATCGTTGGACAAACTGGAATCGTACTAATCCAGAAGCTTCAGTCGG IOI'IC'ICj I I I GGAGATTCAGGTATCTT 

gagcaaacgctccgttgataatctaagtgtcggattccatgaagaccattc 

attgagtattatgttggtaagactgtcccaacacctcctaaaaacccta g it l ' l ' lit 1 ggtaatgagg accatgtct 

ttaatgatrctgccaactggaaaccagttactaatctaaaagcccctgcrcaactca 

actttagctttcataaagttgaaacctatgctgtt 

TATCACACAGGTACAAATCTTTGCGAAACAAGTTCCGGCAGCCAAG 

CAAAGAOTAGCAAACTrCAACCCTGATrnjACAGACTACTACaTGAGT 

ACAGCAAGTGTTAGCAACAATGGTCTCGCrACCGTCGTTCCAAGCGTTCGTGAAGGTGAGCCAG 

CG AAAGCTG AAAATGGCG ACATCTT AGGAG AATACCGTCTGCACTTCACTAAGGATAAG AGCTTAC I'l 1 L I LATA 

AACCAGTTGCTGCGGTTAAACAAGCTCGOTGCTACAA 

TTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGACAGTItjAATGGGAAGAAGTTCCAGCGGA 

G ACAAAAGCAGGTCAATTTACTGTTCG AGGCCGTGTCCTTGGT AGTAAC C 1 lOi ' l GCTGAGATCACTGTACGACTG 

ACAGACAAACTTGGTGAGACTCTTTCAGATAACCCrAACTATGATGAAAACAGTAACCAGGC CI 1 I UC1 1 CAGCA 

ACCAATGATATTGACAAAAACTCTCATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATC 

GGACAAACTGGTCACCAACACCATCTrCTAATCCAGAAGTATCAGCGGGTGTC 

TTGTAGAACGGACTGTTACACAAGGAAAAGTTCAG r I'C r r igcag atagtggtacggatgcaccatctaaactcgt 
TTTAGAACGCTATGTCGGTCCAGACrrTTGAAGTGCCAACCT^ 

CCATTCAACAATCCAGAAAATTGGGAAGCTG ITCC !* !' ATCGTGCGG ATAAAG ACATTGCAGCTGGTGATGAAATC 

AACGTAACATTTAAAGCTATCAAAGCCAAAGCTATGAGATGGCGTATGGAGCGTAAAGCAGATAAGAGCGGTCT^ 

GCGATGATTGAGATGACCTTCCTTGCACCAACrrGAATTGCCTCAAGAAAGCA 

GAAAAGAACTTGCTGATTTCGCn^AAAATCGTCAAGACTATCAAATTACCTATA 

CAGTTGAAGAAAACAATCAACrrAGCrrCAACTCrrGGTAGATAGTCKi 

CGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTTGACTAAGGAA 

AAACCAGTTTCTG AG AAGACAGTTGCTGCTGTACAAG AAGATCTTCCAAAAATCGAA IT ITS' I' r GAAAAAGATTTG 

GGATACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAj^ 

GGAAAAGAACGTATCTrTACAGCGATTAATCCTGATGGAAGTAAGGAAGAAAAACTCCGT 

CCGACAGACCGCATCGTCTTXKnTGGAACCAAACCAGTAGCTCAAGAAGCTAAA^ 

GCAGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAACTAATAAAGCCCAGTTACCAAGT^ 

GCAAGCCAAGCAGCAGTAGCAGCAGGTTTAACTCTTCTAGGTTTGAGTG^ 

AAAGAAGACTAG 



4101.5 

ATGGATGCAATCTTTGACCTAATCGGAAACGTTrTCAATC^^ 

TCATTTTGACAGTATTGGCTTTA Cl T I ' l ' l GGAGTGAAATTCTCCAAAGCGCTTGAAGGTGGTATCAAACTTGCCAT 

CGCTCTTACAGGTATCGGTGCTATCATCGGTATGCTAAACACTGCTTTCTCAGCATC^ 

AACACTX3GTATCCAATTGAGTATTA(XGACGTTGGTTGGGCACCACTTGCT 

CACTATACTTCTTGCTCATCATGTIXjATTGTCAACATAGTGATGCTAGCTATGAAGAAAACAGATACA 

CG AT ATCTTTG ATATCTGGCAC 1 1 0 1 CTATCACAGGTCTCTTG ATTAAATGGTATGCTOATAACAATGGTGTGAGT 

CAAGGGGTTTCACTCTTTATTGCTACAGCAGCTATCGTCCTTGTCGGTGTC 

GAAACCTACATTTGATGACCTTCTTAACGCCCCAAGTTCATCACCAATC 

AACCCAGTTATCATGU ITl'lCGATAAGA 1' 1 1 1 1 U AAAAATTCTTCCCAGGCCTTG ATAAATATGACTTTG ATGCTG 
CTAAATTGAACAAOAAAATCG G 1 1'rLU Xi GGGATCTAAATTCTTCA'I tJUijri 1 CATCCTTGGTATCGTTATCGGTATT 
ATXKKjAACrCCACATCCAATTGCACGTGTTGCAG ATGCAG ATAAATGGCCT l* C ri' G I ' 1 ' ATCAAAGGATGGTTGTCTC 
TTGGTTTCACrcCCGGTGTATCTTTGGAACTm 

TCACAAGGTATTACAAACGTTXjCTACTAAACGTCTTCAACKjACGTAAATTCAATATC 

TCGCTGGTCGTCCT G AAATCTGGGCTTXn'GCCAACGTACTTGCACCAATCATGTrGATTC 1 1 1C 

AAAAGTTGGAAj^TGGTATCTTGCCACrTGCAGGTATCATCGCTATGGGTGTTACT 

CGTGCTAAATTGCTCCGTATGATTATCrTCGGAACACT C I ' lCi IT GCCACTCTTC CITC ' l ' l r CAGGTACACTTATTGC 
ACCATTTtjCAACAGAACTTGCTAAAGGTGTAGGTGCCTTCCCAGAAGGTGTGAG^ 

TCTACTCTTG AAGGACCAATCG AAAAACITC I'l GGTTGG ACAATTGGTAACACTACAACTGGTG ATATCAAAGCAA 

TCCTTGGTGCAGTAGTCTTCCTTGTATTCTATATCGGTATCTT^ 

CGAAGAGTACGCAGCAAAAGCAAAATAA 

4102.1 
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ATOAAGATTATG AAA AAAA AATATTGC ACTTT AGCGATATT ATT L I 1 1 1 0 111 0 n 'CAATAATTCTGTTACTCCTCA 
AGAAATACCTAAAAATOTG ATGGCAAT ATAACTCACACTCAG ACTAGCG AAAU 1 ' 1 1 11 Li U AATCTG ATGAAAA 
ACAGGTTGACTATTCTAATAAAAATCAAGAAGAAGTAGACCAA^ 

ATTArnyrAAC^ 

TATTGTTAACTCTGAAAGTAATAATTTACTAGGCGAAGATAATTTAGATAATAAAAT^ 

CTAGATAATAGAGGAGGAAATATAGAGCATGACAAAGATAACTTAGAATCGTCGATTGTAAGAAAATATGAATGG 
GATATAGATAAAGTTACTGGTGGAGGCGAAAGTTATAAATTA 

TTTTAGATTCAGGAGTCGATTTACAAAATACTGGATTACTGAAAAATCTTTCAAATCACTCAA 

CAATAAAGGATATTTAGGAAAAGAGGAGGGAGAGGAAGGAATAATATCAGATATTCAAGATAGATTAGGTCATC 

GTACGGCTGTTGTAGCTCAAATTGTAGGGGATGACAATATTAATGGAGTAAATCCTCACC 

TAGAATATrTGGTAAGTCGTCAGCTAGTCCAGATTGGATTGTAAAAGCA 

G ATATTA TCAATCTTAGTACTGGACAATATTTAATGATTGATGGAGAATATC 

CATTTTTGAACrrATAAAAAGCKrrATTGATTACGCGAATCAAAAAGGACrr 

ACTCCCTAAATGTATCAAATCAGTCAGATTTATTXjAAACTTATTAGTTCACGC^ 

AGTAGTTGATGTT CCAA GTTATTTCTCATCTACAATTTCGGTCGGAGGCATAGATCGCTTAG 

GTAATTTATCAGATTTTAGCAATAAAGGGGATTCTGATGCAATATATGCGCCTGCAGGCT 

AQAATTAGG ACTTA ATAACTTTATTAATGCAGAAAAATATAAAGAAGATTGGATTTTTTCGG 

ATATACGTATCTTTATGGAAACTCATTTGCtGCTCCTAAAGTTTCTGGTGCG^ 

AATTAAAAGATCAGCCCTATAATTATATtnTTGTAAAAAAATTCTGGAAGAAACATTACCAGTAA 

4106.1 

ATGAAGAAAACATGGAAAO 10 l'ri'l'I'AACGCTTGTAACAGCTCTTGTAGCT Cj l' I G IXiCTl GTGGCCTGTGGTCAAG 

GAACTGCTTCTAAAGACAACAAAGAGGCAGAACTTAAGAAGGTTGACTTTATCCTAGACT 

ACCACACAGGGCTTTATGTTGCCAAGGAAAAAGGTTATTTCAAAGAAGCTGGAOT 

CACCAGAAGAAAGTTCTTCTGACTTGGTTATCAACGGAAAGGCACCATTrGC\GTGTATTTCC 

TAAGAAATTCKjAAAAAC^AGCAGGAATCACrGCCGTTGCAGCTATTGTTG 

TCGTAAATCTGATAATGTAAGCACrrCCAAAAGACTTGGTrGGTAAGAAATATGGGA 

ACTTGCTATGTTGAAAACCTTGGTAGAATCTCAAGGTGGAGACTTTGAGAAGGTTGAAA 

CTCAAACTCAATCACACCGATTGCCAATGGCGTCTTTGATACTGCTTGGATT^^ 

GCTAAATCTCAAGGTGTAGATGCTAACrrCATGTACTTC 

TTATCATCGCAAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGC^ 

GCTACCAATATGCCATGGAACATCCAGAAGAAGCTCCAGATATTCTCATCAAGAATGCA 

AACGTGACTTTGTCATCGAATCTCAAAAATACTTGTCAAAAGAATACG 

TTGACGCAGCTCGCrGGAATGCTTTCTACAAATGGCATAAAGAAAATGGTATCCTTAAAGAAGACTTG 
AAGGCTTCACCAACGAATTTGTGAAATAA 

4106.4 

ATGATAAAAAATCCTAAATTA TTAAC CAAGT L1 11 IT 1 AAGAA O 1 ' I'l ' 1 GCAATTCTAGGTGGTGTTGGTCT AGTCA 

T TCATATA GCTATTTATTTGACCTTTCCTTTTTATTATATTCAACTXjGAG 

AOTGTrrACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCTTACT^ 

GAAAGCTATCAAATTATATCGTGATGTTAGATATGTCTGTTAGTACAGCAGA 

TGTT CACGGGGTGGATGTCTA CAA AGAAG CAAAGAATA 1 1 1 lOCrrri GTATCTCCCATATACATnTrGGTrACA 
ATTGCl 1 1 1 ILLTlTOrn I'I'l Ctl ATTTTTATACTAAACGCITGCTCAATC CTCrri ' l J ' r ACATTTCAGAAGTGACT 

agtaaaatgcaag atttggatgacaata i icol 1 1 1 g atg aaagtagg aaag atgaagttggtgaagttgg aaaa 

cagattaatggtat gtatgagca citgttgaaggttatttatgagttggaaagtcgtaat 

tgcaaaatca>aaggtttcctttgtccgcggagcatcacatgagttc 

cctagagaatatgcagcataatattggagattacaaagatcatccaaaatatattgcaaagagtataaataagat 

TGACC^GATGAGCCACTTATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTC 

CTTGACTGTTAAGCCAGTITTAGTAGATATTTTATCACGTTATCAAGAATTAGCTCAT^ 

AAAATCAATTGACAGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTG 

CTTAATGCAATTAAATATTCAGATAAAAATGGGCGTGTAATCATATCCGAGC^ 

ATACSTOTGCGCCTCTAAGTGACCAAGAACTAGAACATTTArrrGATATATT 

TAAGGATXjAAAGTTCCGGTTTGGGTCTTTACATTGTGAATAATATTTTAGAAAGCT 

CTCCCTTATGAACACGGTATGGAATTTAAGATTAGCTTGTAG 

4106.6 

ATGTATTTAGGAGATTTG ATGGAGAAAGCCG AGTGTGGTCAATTTTCAATA C 1 ' ITCL ' l i 1 CTATTACAAG AGTCTC 

AGACGACCG TCAA GGCTGTAATGGAAGAAACAGGATTTTCAAAAGCAACCCTAACG\AATATGTCA 

ATGACAAGGCTTTOGATAGTGGCTTAGAGCT GGCTA TTCACTCAGAACATGAAAA 

AGCTACCAAGG GGAGAG ATATTCGGAGC 1 1011 IT 1 GG AG ACTGCIXTTT AAAT ACCAG ATTTTCGTTT ATCTTCTC 

TAC CACC AACAGTTTTTAGCCCATCAGCTGGCTCAAGAATTGGTGATTAGCGAGGCrACGCTTGGTCGTCAC^ 

CTTjGTTTAAATCAGATTTTGTCAGAATTTGATTTATC 

TCACTATTTCTATTrCTGTCTTTTCCGAAAGGTCTGGTCGAGTCAGGAATGGGAAGGT 
AGAAAACAGGAGATTGCCAATTTAGAGGAAATCTGCGCrrGCAAGTTTGTCTGOT 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



32 



PCT/GB99/02451 



CTCTG CXjCTC ACATCAG TCAAC AACGTCTTCGGCTCAATGCT^ 

gggtattttgacaatatcttttatcttcgtttgctgagaaaggttccot 

agcagttgaggatgctgagatgatgatattcttctcttitctcct^ 

agtatattcttggttttggagggcagttggcagatttactgacgcaattgattcaagaaatc 

tattgggggattatacagaggaccatgtcacctatgaactcagtcagctttgtgctcaagtctat 

ctatattttacaggatcgctao\agtaccagttaga,gaa 

gagacagc\gaggagattttt(^tgctctacctgcttrrcaa 

AATG GCTC CAGTTA ATCGAATATATGGC TGAAA ACGGTGGCCAGCATATGCGGATTGGTCTGGATTTGACATCTGG 
TTTTCrrGTCTTTTCAAGGATGGCAGCCATTTrGAAACGGTA 1 i 1 1 ATT ACCATTG AAGCTT 

ATGACCCTAGTCGGCATTATGATTTGCTGGTTACCAATAACCCGATTCATAAGAAGGAA 
TTTAAAAAATGACTTGGATATXKjAGGATTrGGTAGCGATTCGCCAGTTATTATTCACTT 

4106.7 

ATGGAATrTTCAAAGAAAACACGTGAATTGTCAATTAAAAAAATGCACW 
GGTGGAGGAATCACAGGAGCTGGTGTAGCCTTGCAGGCGGCAGCTAGCGGTCTTGAGA 
CAAGACrrrcCAGAAGGAACATCTAGTCGTTCAACAAAATTGGTTCACGGAGGACT^ 
ACGTAGAAGTGGTCTCAGATACGGTTTCTGAACGTG 

ATCCAATGCTCTTACCAGTTTACGATGAAGATGGAGCAACCTTTAGCCTCTTCCO"IV 

GTACGACCnrrrGGCAGGTGTTAGCAACACACCAGCTGCGAACAAGG l I I1G AGCAAGGATCAAGTCTTGGAACG 

CCAGCCAAACTTGAAGAAGGAAGGCTrGGTAGGAGGTGGAGTGTATCTTGACTTCCGTAACAACGATGC 

CGTG ATTG AAAACATCAAACGTG CCAACCAAG ACGGTG CCCTCATTG CCAACCACGTG AAGGCAG AAGG CTTCCT 

CTTTGACGAAAGTGGCAAGATTACAGGTGTTGTAGCrCXrTGATCTCTrGAC^ 

CGTCTGCrrrATrAATACAACAGGTCCrrGGAGTGATAAAGTACGTAATTTGTCTAAT^ 

AAATGCGCCCAACTAAGGGAGTTCAOTGGTAGTAGATTCAAGCAAAATCAAGGTTrCACAGCCAGrrr 

ACACAGGTTTGGGTGACGGTCGTATGGTCTr IUITCTCCCACGTGAAAACAAGACTTACTTTGGTACAACTG ATAC 

AGACTAC\CAGGTGATTTGGAGCATCCAAAAGTAACTCAAGAAGATGTAGATTATCTACTTG 

CCGCTTCCCAGAATCCAACATCACCATTGATGATATCGAAAGCAGCTGGGCAGGTCTTCG 

TCCATTGATTGCAGGGAACAGTGCCTCTGACTATAATGGTGGAAATAACGGTACCATCAGTGATGAAAGCT 

CAACTTGATTGCGACTGTTGAATCTTATCTCTCCAAAGAAAAAACACGTGAAGATGTTC 

CTTGAAAGTAGCACATCTG AGAAACATTTGGATCCATCTGCA O lTI'd CGTGGGTCTAGCTTGGACCGTGATGACA 

ATGGTCTCrTGACTCTTGCTGGTGGTAAAAT CACA GACTACCGTAAGATGGCrGAAGGAGCTATGGAGCGCGT^ 

TTCACATCCTCAAAGCAGAATTTCACCGTAGCmAAATTGATCAATTCTAAA^ 

ATTGAACCCACCAAATGTCGATTC^AAATCGAACCCTrrGCGCAACTTGGAGTA 

GGAAGCTCACTATCTGGCAAATCTTTACGGTTCAAATGCACCGAAAGTCTTTGCACTTC 

GCGCCAGGACTCAGCTTGGCAGATACTTTGTCCCTTCACTATGCAATGCGCAATGAGT^ 

ACTTCCM'CT 1 CGTCGTACCAATCACATGCTCITrATGCGTGATAGCTTGGATAGTATCGTTGAGCCAATTTTGGAT 
GAAATGGGACGATTCTATGACTGGACAGAAGAAGAAAAAGCAACTTACCGTGCTGATGTCGAAGCAGCTCrCGCT 
AACAACGATTTAGCAGAATTAAAAAATTAA 

4106.8 

ATGATGAATGAATTATTTGGAGAATTrCTAGGGACTTTAATCCTGATTCTTCT 

TGGTTCTTCCTAAAACCAAGAGCAATAGCTCAGGTTGGATTGTGATTACTATGGGTTGG 

TGCAGTCTTTGTATCTGGCAAGCTCAGTCCAGCTTATTTAAACCCAGCT 

GGTTTGCCTTGGGCTTCCGTTTTGCCTTATATCTTAGCCCAGTrCGC^ 

GTTGCAATTCAAACCTCACTATGACKSCAGAAGAAAATGCAGGCAATATC 

CATCAAGGATACTGTATCAAACTTGATTAGCGAAATCCTTGGAAC 11 1 lliiTl 10U101 fGACAATCTTTGCTTTGG 
GTCTTTACGACTTTCAGGCAGGTATCGGAACL 1 1 1 UCAGTGGGAACTTTGATTGTCGGTATCGGTCTATCACTAGG 
TGGGACAACAGGTrATGCCrTGAACCCAGCTCGTGACCTTGGACCT 
AACAAGGGAGACGGAGACTGGTCTTACGCTTGGATTCCTGTrGTAGGCCCTGTTATCG 
CTTGTATTCTCACTTTTCTAG 

4106.10 

ATG AAAAGG ACCTGG AGG AACTC ATTCGTG ACAAATCTTAAT ACAC L' 1 ' l'l 1 ATGATTGGCAATATTGAGATTCCCA 

ATCGTACCGTTTTAGCGCCTATGGCTGGCGTGACCAACTCAGCCTTTCGTACTATCGCAAAOT 

ACTCGTTGTAATGGAAATGGTCTCTGACAAGGGAATCCAATACAACAACGAAAAAA 

CG ATGAGGGCG AAAACCCTGTCTCTATCCAAC 1 1 1 'l'l GGTAGCGATGAAGAGAGCCTAGCACGCGCAGCAGAATT 
CATCCAAGAAAACACCAAGACCGATATCGTCGATATCAACATGGGCTGCCCTGTCAACAAAATCGT 
AGCTGGTGCTATGTGGCTCAAGGATCCAGACAAGATTTACTCCATCATOVACAAGGTCCAGTCT 
CCACTTACTGTCAAAATGCGTACCGGCTGGGCGGACCCATCTCn^ 

CAGGTGTTTCTGCCCTCGCCATGCATGGCCGTACCCGTGAACAAATGTATACrGGCCACGCAG 

TTACAAGGTTGCCCAAGCTCTAACCAAGATrCCATTCATCGCCAACGGTGATATCCGTACTGTCCAAGAAGCCAA 

GCAACGCATCGAAGAAGTTGGTGCTGACGCAGTCATGATTGGCCGAGCTGCCATGGGAAATCCTTACCTCTrC^ 

CCAAATCAACCATTACTrTGAAACAGGAGAAATCCTACCTGATTTGACCTTTG 

CGAACACTTGAAACGATrGATTAACCTCAAAGGAGAAAACGTCGCAGTTCGTGAATTCCGCGGTCT 
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CTATCTCCCTGGAACATCTCGCGCTCCCAAACTCCGTGGAGCCATTTCGCAAGCCAGCACCCTGCCAGAGATTGA 

aaccctcttgcaattggagaaggcttaa 



4107.1 

atgacaaagaagaaaattgagcgtatttctgtaatacaccgagaaaagattttatggctcaagtg 

gagataaagaacaaccraagtatagtgtccttcagcgtaaaatgttoatgct^ 

cttatcaaaaatacgcaactatcaagcagataacagatattacggta 

ctgtaaaagaggtttatgtgtacaatcacatgaatgttatcggagcttgt 

acg\gcttatgataagttaaataagtggtttaatatcta7tctgatttgtattttagcgttg 

tgggggtatatcatgagatggtaggtatctag 

4107.2 

ATGAAAAATTCCAACGAGGCTGAGATGAAATTACTTTATACTGATATTCGGA C1 ICITI GACAGAAATTCTAACAA 

GAGAGGCAGAAGAGCTAGTTGCAGCTGGGVAGCG GGTCT TCTACATTGCCCCCAACTCT L ' ri ' lCi ' l ' ri G AAAAGGA 

ACGCGCCGTGCTGG AATACTTGTCCCAGCAGGC 1 IC1 IT n'CGATTACCGTCACGCGCTTTGCTCAAATGGCTCGC 

TATCTGGTCTTG AATG ATTTACCAGCTAAAACTACTCTTG ATG ATATCGGTCTTGGGTTGGC C I ' IT l 'ACAAATGCCT 

TGC CGAA CTCGATCCCAAGGACTTGCO I b I l'l ATGGCGCTATTAAGCAGCATCCTCAATTGATCCAGCAGTTAATT 

GAGCTTTACCATGAG ATGACC AAATCTCAGATGAGTTTTTTGGACTTGGAGAATre 

GCGGATTTACTCTTGATTTTTGAGAAAGTAACAGCCTATCn'AAT 

CCCATTTGATTGAGGCTATTGAGAATGACAAGGTAAGTAGTGATTTTAATCAA^ 

TACTCGTTTTTCTGCTGAGGAAGAGCGC KTrrGT GGACTTACTTCACGGCAAAGCT 

TATGCTACTTAAGAAAGCCTATACCACrrCCTTTTAGCGACK^GCAATCTCTACCAAGC 

ATCTG GCTT CTAAATACCAAACGCCTGCTCAGGACTGTTCTCAAACTCATGAG 

CT C 1 cu n 1 G n U G AGT CTTCTTATGACTTTTCAGAACTCGCTTTGG ATGTCG ATGAG AAAG accgtgaaaattta 
C AAATCTGGTC I ' 1 0 1 l'l GACGCA AAAG GAGGAGTTGGAGCTAGTAGCCCGTAGTATTCGTCAGAAATTACATGAG 
AACrG\GACCTGAGCTACAAGCATTTTCGTATTCrCTTGGGCKjA 
TTTTTCACCAGTATCAGATTCCTTTTTATCTTGCrrAGAAGCGA^ 

0 AGTCTATTTTAGCTrrAAAACGTTA CCGTT rrCGTCAGG AGGATTTOATTAAT C l ' l C 1 ' 1 ' AG AACTGA l ' l ' 1U 1 ATAC 

TGACCTCAGTCAGTCTGATATTGATGCTTTTGAGCAATATATCCGCTATCTTGGTATCAATGGCTTGCCAGC 

AGCAAACCTTCACCAAATCCCACCATGGAAAATTTAATCT 

ACCTCTTT3AAACCCTCTTTGCCAGCCGAAAACAAAAGGCTGAAA 

AG AAGG AGCTGT GACC AAGCAG TTACAA QATTTCACAACCACTTTGGAAGCTGTAGA^ 

AAGTTTGGAAGGCTTTCTGCCAT Ol 1 1 i AC AACAATTTGCG ACTO l'l' l'l' I GCTGGTTCACAGGTTAGTCTGGAAG A 
CTTCCTAGCCTTGCTCCATTCTGGAATGAG l'l' lb I CCCAATACCGTACCATTCCAGC^ACAGTGGACACTGTTCTG 
GTGCAGAGTTACGATTTGATTGCACCATTGACTGCTGACTTTGTCTATGCTATTGGA 
CAAAAATTTCTCAAAACACCAGTCTTCTGACAGATGAAGAAAGGCAAAACCT 
GTTCAATTACTGATTGCCAGCAGTGAAAATCTCAAGAAAAATCGCT 

GTAAG CAGTT GT' 1 C lU'GTCGGCTCCAAGCCTTTTTAACG AAAGTGAAAGTAAGGAATCTCCCTATCTTCAAG 
GATCCAT TTTGG ATTTAGGCGGAGAGAGAAGAGGATGAATCACAAAGGACTGTCTAAGGAGGATATGGGGTCCT^ 
TCACAGTt 1111 GTCTAGTCTGGTTGCCTATG^CCAGCAGGGTGAGATGAGCGATACTCAGCAAGATTTGACTTTT 
GTCAAGGTTCTGTCGCGTCTCATAGGTAAAAAACTAGATCAGCAAGGTCTGGAAAATCCAGCTATCCCAACCAGT 
CCAAGCAGCAAGACCTTAGCCAAGGACACCTTGCAAGCTCT 

CTATCCTG C CAAAC AG G AGTTTTACCTGTCTACGTCGGGTTTG ACAG A G 1 1 11 1 1 ATCGCAATG AATACAGTT ATTTC 
CTAC GCTA COlTlTAGGCTTGC^GGACKaAATTA^ 

GTATCTTTGAACGCGCCTTACAGTTGCCTAATGAAGATTCCTTTGACCAACOT 

cagtcaagaacgcgaatttgaagctatttatcaagaaagtttggaagcccagtttaccaaggaa g rrnuu 1G AT 

CrTTGCACGGACAACTGOACATATTCTCCGACACAATCCAGCCATCGAAACCATCA^ 

GGAAAAGACCAAGCCTTTATTCAATTAGACAATGGACGCAGTGTCmGTACGAGGCAAGCT 

CGTTTGAAAGCTAATGGACKXJATAGGAGTAGTAGACTACAAATCCAGTCTC 

TTAATCGGCTCAATTCTCAGTTACCAACCTATmGCTGCCCTAAAAAGAG 

CATGTACrrGGAAATGCCTX3AACCTGTCCAATCrCTGATGGCGGTAAA 

CAGCAAATCTATGAAATACCAAGGGCTCTTmCKjAAAAAGAAAGCAGTTAm 

CAAGGCT AATC AACTGACAGATGAGGAATTTCAGCTCCTACTGGACTACAATGCCTATCTTTACAAGAAAGCro 

GAGAAOATTTTAGCAGGCCG GTTC GCCATCAATCCTTATACTGAAAATGGCAGAAGCATTC 

CAACATCAGGCTATTACAGGCTTTGAAGCCAATTACCATCTGGGCCAAGCCCGTTTCCT^ 

CTGATGGCAAGCGTCTGGTCGGAGAAAAACTCAAGCAAGCTTGGCTTGAAAAAATAA 

GAGAGGAGTTGAATCGATGA 

4107.3 

ATGAAGCTTATTCCCTTTTTAAGTGAGGAGGAGATTCAAAAACTG 

CAGAACAAAACTGCCGAGCAAATCGAAGCTATCTACACTTCTGCCCAGAAT^ 

TCTXXjAAAGACCTTTGTCATGGCAGAGCGCATTCTGGACCAATTGGCGCGTGGT 
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TCTCAACCTITACCCTCAAGGCTGCAACTGAACTTAAAGAACCnTrAGA 
AAACAGATGATTjTCGACCTCAAACAACACTTGGGTCGCC\ 

tggattctttcacacaaaaattccttggcaaacatggttatctgcttgatattgcacct 

aaccaaa gcgag caacttattctcgaaaacgaagtctttcatgaggtctttgaagcgc^ 

aaagagaccttragtcatttcctgaaaaactttgctgggcgtggcaaggac 

tataaaatctatgacitcctccaatccaccacrraatcctcaaaacrrggcrgagtc 

aga^gctgattttaccagtgaaaaagaaaaactgaccgagcaaatca 

ttttccgttaccatctggataacgatgccaaggagtttgcaaaggctgcctatttagaaaatgt^ 

ggatgaaattggctccctaaatcaggacrrccgatacrrcaggcttatcaggo 

tctaagcagaaaaacggtcgagctctgactaatgccagccgtaaggctgatttcaagcccctggctgatgcctac 

aacgaagagagaaagacccagtttgctaaactaggacaattatcagaccagatagcgat 

tctcgactatcaagaacgttatcatggagacacttggaaactagctaaaaccttccaatctttcatc^ 

gtagaggcttatcgtcagagaaaacgacaggaaaatgccttcgaattcgcix3atatcagccattacaccattgag 

attttagagaatttcccacaagttcgtgagtcttatcaggagcgcttccatgaactcatgctcgatc 

ataccaaccatattcaagaacggatgctggaattcrrrctctaatggccaca^ 

gcaatccatctatcgtttccgtcaggcagacccgcagattttcaatgagaaattccaacgctatgcgcaaaat 

caag aaggcaggctcattatcctcaaggaaaatttccgtagtagttcagaagtgcrgtcagcaaccaatga 

tttgaacgtctcatggaccaagaggtcggcgaaatcaactatgataacaagcacca 

aactgactcccaatccagacaacaaggcagcatttctcctctacgacaaggacgatacaggtgaggaagaagaga 

gtcaaacagaaacgaaactaacaggcgaaatgcgcttagttatcaaggagattctgaaacttcatcaag 

gtgttgcctttaaggaaattgcccttctgacctccagccgcagtcgtaatc 

gtacggaattcctgtcaaaactgacggagagcaaaacaattatctccaatcxcta 

cactlu'rlvix/i'cattcacaatcccctgcaagactacgccttggttc g 

ATGACKjATGAGCTAGCACGTTTGTCCCTTCAGAAAGCAGAGGATAAAGTCCACGAAAATCTCTATC^ 

tcaatgcacaaaaaatggcaagtagtcaaaaaggcttgattcacacagctctagctg 

aaaaactaaagcaattcatggatatcctagcttcttggc 

ttggaagatttacaacgaccgtttttattatgactatctrrggggctttgccgaa 

ctctatgccctagcactxicgtgctgatcaatttgaaaagagcaatttcaaag u 1T1 0 1 L GCGTTTTATTCGTATGA 

TTGACCAACTCTTAGAAGCCCAGCACGATTTGGCAAGCCTGGC 

TGACCATCCACAAGAGTAAAGGGCTGGAGTTTCCTTACGTCTTTATCCTCAATATGGATCAAGATTT^ 

AGACTCTATGTCAG AAGTCATTCTCAGTCGTCAGAATGGTC 1 1 OG IXJ ICAAATATATTGCCAAGATGGAGACAGGG 

GCAGTAGAAGACCACTATCCTAAAACCATCAAACTCrCCATTCCTAGTCntjACCTATAGGCAGA^ 

TTACAGCTAGCAAGCTATTCTX3AGCAGATGCGTTTGCTGTATGTTGCTATGACGCGGGCTGAG 

TTGTCGGCAAGGGTTCTCGTGAAAAGCTGGAATCCAAGGAATACCCAGCAGC^ 

A TACT AGACTG CAAGGA CGGAXTTTCCAAGATTGCCTTrGGGCTATCACT 

ACTTTAGTTATCGTTTTATTGGCG AAGATCAGTTG ACCAG AG AAGCTATCGGAGAGTTGGAAACCAAG AGTCCTCT 

C CAAG ATAGCTCCCAAGCAGACAATCGTCAGTCAGATACCATCAAAGAAGCrCroGAAATGCTGAAGGAGG^ 

AGTTTATAATACTCTTCACCGCGCAGCTATTGAACTTCCTAGTGTTCAAACCCC^ 

GAACCAG TTATGG ATATGGAAGCTGTCGAGATTGCTGGTCAAGGTCAGTCAGTAGGCAAGAAAATCAGCT^ 

TTGCCAGATTTTTCAACCAAAGAAAAGGTAACrGCAGCTGAGATrGGTAG^ 

ATTGACCT(^GCCAGCAACTAACCCTTGCTAGCCTAACAGAAACACrCAAA^ 

AGAG ACAAG ATCAATCTTGA TAAAA TTCTTGLl I 111 I'lGACACAGTACTCGGTCAGGAAATTCTTGCTAATACCG 
ACCATCTTTATCGCGAGC AACCT TTCTCCATGCTCAAACGAGACCAAAAGAGTCAGGAAGACrrTGTTGTCCGTGG 
TATCCTTGATGGCTATCTGCTTTACGAAAACAAAATTGTTCTGTTCGACTAC^ 

AGTCAACTCGTAG ACCGCTATC GTGGTCAGTTAGCTCTATACGAAGAGGCTTTATCACGAGCCrATTCGA'n'GAAA 
ATATTGAAAAATACTTGATTTTACTCGGTAAAGACGAGGTTCAAGTTGTAAAAGTATAA 

4109.1 

ATGGAACTTGCTCGCCATGCTGAAAGCTTGGGAGTAGATGCTATTGCAACGATTCCACCAATT^ 
CAGAATACTCAGTTGCCAAATACTpGAACGATATCAGTTCTGCAGCrCCAAAGVCAGACT 
TTCCTCAATTGGCAGGGGTTGCrrTGACTCCAAGCCTTTACACA rCCTCGTGTTATCGGTGT 
GAAGAACTCTTCTATQCCAGTTCAAGATATCCAAACCTTT 

GGTCCTGATCAGCAGTTCCTAGGAGGACCCCTCATGGGGGCTAGGGCTGGTATCGGTGGTACT^ 

CAGAACTCTTCTTGAAACTCAATCAGTTGATTGCGGATAAGGACCrAGAAACAGCGCGTGAATTGCAGTA 

CAACGCAATCATTGGTAAACTCACTTCTGCTCATCGAAATATCT^ 

GAAGGCrTCAATATTtKjATCTGTTCGTrCACCATTGACACCAGTGACTGAAGAAGATCCT 

GCTGCTGCCTTGATTCGTGAAACCAAGGAGCGCTTCCTCTAA 

4U0.2 

ATGTATAAGACAAAGTG TTTACG AGAGAAGTTAGTATTA 1 1 1 IT AAAAA I I 11 C W CCCAATCCTGATCTACCAAT 
TTGCCAATrATTCTGCCTCTTTT GTTG ATACTGCAATGACAGGTCAATACAACACTATGGACTTGGCT 
ATGGCAACCAGTATCTGGAATCCT I " I CrrrACATTTCTAACAGGGATTGTCTCAGCCTTGGTGCCTATCATTGGTCA 
CCATCTTCGTCG AGGCAAAAA GGAAG AAGTTGCGTCTG A 1 1 ri'r ACCAATTTATTTATTTGGCCTTGGGCCTATCT 
GTGGTCTTGCTGGGGATGGTACTTTTCTTGGC^CCAATAATCTTG 
CAGTAGCGGTTCGCTATCTTTGGTTTTTATCTATCGGGATTATCC^ 
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tggattcgctgcgcttg acca aactgtccatgtacctc atg c l'l'l'l gttactccctctcaat agcgg atttaactat 

ctcttga^ntacgctgccttrggtgttccagaactgggagcggctggtgct 

gggtcttgcttgggatttctgttctggttttatttaaacaggagaagct 

tcca crtaat atggataaaattaaggaaggagttcgtttaggtctgcctattgggggaactgtc^ 

gctatc ntrcagtggttggc ttga ttatggctaagttttcgcccttc 

cttttcaagtcttatgtacgcctttcctatgagtatctcatcggctatggcta y u i tl' g tl' 1 cctatgaagtgggag 

CCAAGC GATTT GA TGATGC GAAAACCTATATTGGTCTAGGAAGATGGACTXjCCCTCA IT I HGCGGCCrrCACCTr 

AACCTTCCTTTACATTTTTAGGGGAAATGTGGCCAGTCTTTATGGT 

CCrrrTTTTAACTTATAGTCTm 

TAAGG ATACAGTT ATTCC I ITlTACL'l 1 liOH'lU GCTTGGTTATTGGGCCGTAGCAATCCCTCTGTACGCTATTTGA 
4UX2 

ATGAGTACTTTAGCAAAAATAGAAGCGCTCTTGTTTC 

GAACTCCTCrCTCTGCCACCGACAGCCATCCAGCAAAGTTTAGGAAAATTAGCCC^ 
GATTCCACTITTGGCITIXjATTGAGACAAGTGGTGCTTATAGA 

AGGAATACTCTAAGGCGCCTATCAACCAGAGCTTCrrCTCGGGCTGCCCrrGAGACCTTGTCCA 

ACAG CCGATTACGCGGATAGAAATTGATGCCATCCGTGGAGTTAACTCGAGTGGAGCCTTGGCAAAGTTC 

TTTTGACCTXjATAAAGGAAGACGGGAAAAAGGAAGTATTGGGGCGCCCCAACCTCTATGTC 

CCTAGATTACATGGGGATAAACCATTTAGAAGAATTACCAGTGATTGATGAGCTTGAGATTCAAGCCC^ 

CCAATTATTTGGTGAAAGGATAGAAGAAGATGAGAATCAATAA 

4113.1 

ATGG AT ACG ATG ATTAGT AG A 1 " I " I" I " 1 ' I'CGCCATTT ATTTG AAG CCTTAAAAAGTTTG AAACG AAATGGTTGG ATG A 
CAGTAGCTXjCTGTCAGTTCAGTCATGATTACTTTGACCTTGGTGGCAATATTTGCATCT 

aaactagctacagatattgaaaataatgtccgtgtagtagtttatatc 

acaattgaaaaagaagctcaaactttttacaaataatgactac^ 

acggttaaaagtgttaccttttca^gtaaagaagaacaatatgaaaaattaacc 

aaaatcmgaaggagatgccaatccnrrctatgatgcctatattgtagagg 

actatagccgaagatgctaaavvaaattgaaggtgtctctgaggtix^ag 

ttcaagttagcttcatttatccgtgtttgggcactagggattgctg c ' i ' l ' l 01 ' i 'aatttttatcgca o i ' ll 1u 1g at 
ttcaaataccattcgtattaccattatrtcccgcact 

AGTTAT ATCCGTGG ACCCT l'l C I ' I XjTTAG AAGG AGCCTTT ATCGGTTT ATTGGG AGCTATCGCACCATCTtj l'l'l'l tiGT 
CTTTATTGTTTATCAAAriO I'll ACCAATCTGTCAACAAATCGTTGGTAGGCCAAAATCTATCCATG ATTAGTCCA 
GATTTATTTACrrCCGTTG ATG ATTGCCCTACTATTTGTGATTGGGO 1 1 1 1 CATTGGTTCATTGGG ATCAGGAATATC 
CATGCGCCGATTCTTGAAGATTTAG 

4117.1 

ATGAAGAAAGTAAGATTTAl'rri'rri'AGCTCTGCTAl l'l'l'l Cl'l' AGCTAGTCCAGAGGGTGCAATGGCrAGTOATG 
GTACTTGGCAAGGAAAACAGTATCTGAAAG AAG ATGG CAGTCAAGCAGCAAATGAGTGGli IT 1 1 1 GATACTCATT 
ATXXATCTTGGTTCTATATAAAAGCAGATGCTAACTATGCTG 

TTACCrCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCl 1 1 1 ATT ATCTTG ACCAA 
GATGGAAAGATGAAAAGAAATGCTTGGGTAGGAACITCCTATGTTCGTC 

TGGGTCTATGATTCTCAATACGATGCTTGU IT l' I ATATCAAAGCAGATGG ACAGCACGCAG AG AAAGAATGGCTC 
CAAATTAAAGCKjAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGA 

TGAATGCTAGTGGTGCCAAAGTACAGCAAGGTTGG CU 1 1 ' 1 G ACAAACAATACCAATCTTGGTTTTACATCAAAGA 
AAATGGAAACTATGCTGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAA^ 
GCAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTC 
TGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACrrCAAATCCGGTGG^ 

GGGATAAGGAATLTIUj 1 1 1 1 ATCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATA 

gtcaagcttggtactacttcaaatccgg 1 go 1" i'acatg acagccaatgaatggatttggg ataaggaatcttggtt 
ttacctcaaatctgatgggaaaatagctgaaaaagaatgggtctacgattctc^ 
aaatctggtggctacatggcgaaaaatgagacagtagatggttatgagcttgg 
ggaaaaactacaaatgaaaatcctgcttactatcaagtagtgcctgttacagccaa 
aaaagcttrcctatatatcgcaaggtagtgtcgtatggctagataaggatagaaaa^gtgattj 
ctattactatttctg om 1 u 1 caggctatatgaaaacagaag atttacaagcgctagatgctagtaagg actttat 
cccttattatgagagtgatggccaccg l'l'l'l' i' atcact atgtggctcag aatgctagtatcccagtagcttctcat 
li 1 1 l'l uatatggaagtaggcaagaaatattattcggcagatggcctgcattttgatgo 1 1 1 i aagcttgagaatc 

CCTTCCTTTTCAAAGATTTAACAGAGGCrACAAACTACAGTGCTGAAGAAT^ 
CATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTrTTAAGGAAGCCGA^ 

tatctccttccccatacrrgccctagaaagtaacrcksggaagaagtaaaato 

gcattacagcct atgatacg accccttacc it i' c l o ctaag acatttgatc atgtgg ataagggaattttaggtgc 

aaccaagtggattaaggaaaattatatcgataggggaagaactttccttggaaacaaggc^ 

ggaatatgcrrcagacccttattgch3gcgaaaaaattgcragtgtgatgatgaaaatcaatgaga^ 

CAAAGATTAG 
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4119.2 

atgaaaaaa ctatt acaaaaa tattgg gca tggg c1 1 i i g tggtcatccccct lti u 1 ' 1 acaagcaa riti ' d tct 
atgtgccga 1 0 n tcaaggagccttttacau ht1 accaactggacagcattgacttataactacaaantg'itgg 
cttaaacaactttaagctcctcttcatcgatccaaaattcatgaatgcga 

gccatggtggttgctgagattgcactcgg gatc ttcattgcgcct g 1 it 1 1 g aattct aaaatcaaaggccaaacct 
tctrccgtgcttggttcttcttcccagctgttttatcrggtttgacagtggctt^ 

TACGGTCTTCCAGCGATTGGAAATGCCCTTCATATTGAATTTTT 

CAATCTTTGCGGCTUTCJ. 1 1 GTCL 1 JCl'I'l OGCAAGGGGTGGCTATGCCCATCATCATCTTCCTAGCTGGTTTGCAA 

TCTATTCCAACTGAGATTACAGAGGCAGCAAGGATTGATGGTGCGACTAGCAAGCAAGTTT^ 

TTGCCITACTTGCrACCAAGTGTCTCTATGGTCTTrATCCTAGCCCTAAAAGGTGGGCTGACT 

CTTTGCCATG ACCGGTGGTGGTCCAAACAATGCCACAACCTCACTTGGGCT C l ' l OG ' 1 1 ' 1 'ATAACT ATGCCTTTAAA 

GAGAGTATCTAAGAAATTTGAAATTTAA 

4119.3 

ATGATGAAACAAGATGAAAGAAAAGCCCTGATTGGCAAATACATTCT^^ 

TGCCGCI'CCM GCTACCCTCTTTAGTTCCTTCAAACCCACTAAGG ATATTGTAGATAA 1 T II ' M ' 1 G GCTTTCCAACC 

AACTTCACATGGGACAACTTTAGCCGTCTCTTAGCTGATGGGATTGGAGGCTArr^ 

CTGTCTTGTCTTTACTTGCAGTAATGATCTTTATCCCTATGGCAGCCTACT 

AAAAGCCTTTACCATCATGTATACCCTCTTAATCCTCGGAATCTTCGTACCTTTCCAAGTCATCATGAT^ 
CGGTTATGATGAGTAAACTCGGTTTGGCTAATACCTTTGGTTTGA 

ACCCTCTTTCTCTATGTTGGCTATATCAAAATCTCGATTCCAGAAAGTCTGGATGAAGCAGCAGAGATCGATGGGG 

CTAATCAATTTACAACCTATTTCCGCATCATCn'CCCAATGATGAAACCGATGCATGCGACAACCATG 

TGCCCTTTGGTTCTGGAATGACTTCATCrrrGCCACTCCTTGTCTTGA^ 

TGTTCCAATACAACTACGCAGGCCAATATTTCAACGACTACGGACCAAGCTTTGCCTCrr 

CAGTATCACCATTCrrCTATCTCTTCnTCCAACGCCATATCAmCAGGAATGAGCAACGGGGCACT 

4U9.4 

ATGAAAAGTATTCTTCAGAAAATGGGGGAGCATCCGATGCTG CnCI'1'1 1'lCl'l AGCTATAGTACTGTTATATCCA 
TTCTTGCACAAAATTGG ATGGGTCTTGTGGCrrCAGTAGG AATGTTTCTATTTACT 1 1 1 ILI 1 1 1 1 GCACTATCAO 
TCGATTTTATCCCATAAATTCTTTCG ATTGATTTTGCAGTTTGT C l ' l 0 1 ' 1 I GGTA GI GIL'1 IblCAG ClXiCTI ' l ' m CC 
AGTTTAGAACATTTCCAAA TTGTG AAG AAATTTAACTATG C ' l ' l ' l ' 1 Cl ' l ' l CACCCAATATGCAGGTGTGGCATCAGA 
AC CGCG CAG AAGTGAC Cl ' lVn 1 AATCCTAATTATTATGGAATTA 1 I ' IOIICjI 1 1 C 1 G 1 A TTATGATTGC TTTCT AT 
CTGT TT ACAACG ACCAAGTTG AATT GGTTG AAAGTATTCTGTGTG ATTGCAGGC 1 11 01 1 AATCTCTTTGGTTTGAA 
CTTTACTCAAAATCG AACTGCCTTTCCTGCTATTATCGCTGG AGCAATTATCTA lVf CT 1 1 1 ACG ACTATTAAAAACT 
GG AAGGCL" l'l T l'GGCTTAGTATTGGGGTCTTCGCGATTGG' 1 T Il> AGTTTCCTdT 11 CTAGTG ATTTGGG AGTTCG A 
ATGGGTACTTTAGACTCTTCTATGG AAGAACGCATTTCTATCTGGG ATGCTCGG ATGGCC 1 ' 1 U I T 1 AAGCAAAATC 
C 1 ' l'l " 1 " 1 GGGGTG AAGGGCCATTG ACCTATATGAACTCTTATCCTCGGATACATGCTCCTTATCATGAACATGCCCA 
CAGTCTTTATATTG ATACGATTCTG AGTTACGGAATTtjTCGGG ACT ATTTTAT^ ' 1 b 1 LI I'lTOT l GCTCCTG 
TTCGCTTGATGATGGATATGAGTCAGGAGTCGGGGAAACGTCCGATTATCGGCCTTTATCTATCTTTCCTTA 
G GTTG CTGTG CACGG AA 1 ' 1 ' l'l ' 1 G ACTTGGCTCTC I ' I'C 11Kj ATTCAGTCAGG CTTT A V 1 1 ' 1 C T 1 O CT AGTTATGTGCA 
GTATTCCATTGGAGCATCGAATGTTGGTATCGGACATGACGGATTAA 

4120.1 

ATCrrCAAAGATGGATGTTCAGAAAATCATTGCACCG ATGATG AAG 1 1 1 U 1 G AATATGCGTGGCATTATAGCTCTAA 
AAGATGGGATGTTAGCAATITTGCCATTGACAGTAGTTGGTAGTTTGTTCTTGATTA 

AGCATTAAATAAGAGCATTGCTACTO 1'1'lTl GCAGCTAATTGGACAGAGCCGTTTATGCAAGTATATTCAGGAACT 
TTTGCTATTATGGGTCTAArriXjn G I" IT l'l CAATTGCCTATTCTTATGCTAAGAATAGCGGAGTAGAGGCTTTACC 
AGCTGGAGTTCrATCTCTATCTGCATTCTTTATTTTGCTAAGATCATCTTATATCCCT 
GGGACGCTATTAGTAAAGTtTGGTTTGGAGGCCAAGGAATTATCGGTGCTATGATTAT^ 

TATTT AT ACL' 1 1 L' I'ri'ATAAAGAGAAAAATTGTTATTAAGATGCCAGAACAAGTTCCACAAGCrATTGCCAAACAG 
TTTG AAGCAATOATTCCAGCATTTGTAA 1 I ' Ul ' l 1 ATCTTCTATGATTGTATATATTTTAGCGAAGTCATTGACTAA 
TGGCGGAACATTCATAGAAATGATTTATTCTGCTATTCAAGTTCCGTTGCAAGG 

GCTATTGGAATTGCATTCTTTATATCA 1 1 1 T 1 b 1 U GTGGTTTGGTGTTCATGGGCAATCGGTAGTAAATGGAGTAGT 
GACAGC ll 'l bCI 1 lUA TCTAATCITGATGCTAATAAAGCTATGTTAGCCrCTGCTAATCTATCATTAGAAAATGGT 
GCACATATTGTTACTCAACAA 1 1 1 1 1 AGATTCA 1 1 11 1 AATTCT ATCAGGTTCAGGG ATT ACGTTTGGTCTTGTAGT 
TGCCATGCTl 1 l'l GCAGCAAAATCAAAACAATACCAAGCCrTAGGAAAAGTTGCAGCrr 1 1 CCAGCAATATTTAAC 
GTAAATGAGCCA G 1101 ATTTGGATTTCCG ATTGTCATGAATCCAGTTAT U ITlb 1 AC Cl ' l ' I' C ATTCTTGTTCCTGT 
ACTTGCAGCTGTGATAGTATATGGAGCTATTGCAACAGGTrrCATGCA 

AGTACACCAGCTATTTTATCAGG A IM' rritjGTGGGTGG ATGGCAAGG AGTTATTACTCAGCTGGTG ATATTAGCG A 
TGTCTACATTGGTTTATTTTCCATTCTTTAAACn-ACAGGATCGTTTAG 

4121.2 

atgaagaaaaaggacttagtagaccaactagtctcagagatcgagacggggaaagtcaggacactgggaatata 

CGGTCATGGAGCTTCAGGTAj^ATCAACCTTTGCACAGGAATTGTACCAAGCTTTAGATTCT 
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CTAGAGACACATCCTTATATCACCTCAGGACGCCATCTGCTAGTACCCAAGGACGCGCCCAATCAAAAGGTGACA 
GCCAGTCTGCCAGTGGCGCATGAACTGGAGAGTrTGCAGAGAGATAT^ 



4122.1 

ATGAAGAAAAGATACCTAGTCTTGACAGCTTTGCTAGCCTTT3ACT 

AATGAAGATGGAGAAACTAAGACAGAACAGACAGCCAAAGCTGATGGAACAGTCGGTAGTAAGTCTCAAGGAC^ 

TGCCCAGAAGAAAGCAGAAGTGGTCAATAAAGGTGATTACTACAGCATTCAAGGGAAATACGATC 

AGCCAACAAACACTATCCATTGTCTAAAGACTATAATCCAGGGGAAAATCCAACAGCG\AGGCAGAGTTGGTCAA 

ACTCATCAAAGCGATGCAAGAGGCAGGTTTCCCTATTAGTGATCATTACAG 1 Uj 1 TV 1 AGAAGTTATG AAACTCAG 

ACCAAGCrCTATCAAGATTATGTCAACCAAGATGGAAAGGCAGCAGCItjACCGTTACTCTGCCCGTCC^ 

AGCGAACACCAGACAGGCTTGGCCTTTGATGTGATTGGGACTGATGGTGATTTGCT 

CAATGGCTCTOGATCATGCAGCTGATTATGGCmGTTGTCCGTTATCTCAA^ 

ATATCGCTGAAGAATGGCACCTGCGTTATGTAGGAAAAGAAGCTAAAGAAATTGCT 

AAGAATACTATGGCTTTGAAGGCGGAGACTACGTCGATTAA 

4125.6 

ATGCGTAAATTCTTAATTATTTTGTTGCT ACCAA CTTTTTT^ 
TCGTCTATACTTCGAAAGAAATTTATTACCTTT^ 
CCCATGGTTTATGGAGAGCrrrCCTGTrrATGCGAATGAAG 
CAAGTTTTCAAATAACCGACTGGCGCTTAAATAAACAAGGAATTCCACT 

AGCT GCGGACAAACGATTTTTATATGATCAATCAGAGGTAACTCCAACAATAAAAAAAGTATGGTTAGAATCT 

CTTTAAACTGT ACAAT AGTCCTTATGATTTAAAAGAAGTGAAATCATCCTTATCAGCTTA 

GACAAGACCATGTTTGTAGAAGGAAGAGAATTTCTACATATTGATCAGGCrGGATGGGTAGCTAAA 

T CTGA AGAAGATAATCGGATGAGTAAAGTrCAAGAAATGTTATCTGAAAAATATCAGAAAGArrCm 

ATGTTAAGCAACTGACTACTGGAAAAGAAGCTGCrrATCAATCAAGATGAAAAGATGTATGCAGCC^ 

AACTCTCTTATCTCTATTATACGCAAGAAAAAATAAATGAGGGTCTTTATCAGTTAG 

ATCTGCAGTCAATGATTTTCCAGGTTCnTATAAACCAGAGG 

AGAATATTCTTTAAAGGATTTAATTACGAAAGTATCAAAAGAATCTGATAATCT 

TAG/VTTTCAAACCAATCTGATGCCACATTCAAATCCAAGAT^ 

AAAAATTGATTTCTTCTAAGATGGCCGGGAAGTTTATGGAAGCTATTTATAATCAAAA 
TTTGACTAAAACAGATTTTGATAGTCAGCGAATTGCCAAAGGTGTTTCTG^ 
CTCATAAAATTGGAGATGCGGATGAATTTAAGCATGATACGGGTGTTGTCT 
ATTTTCACTAAGAATTCTGATTATGATACGATTTCTAAGATAGCCAAGGATGTTTATGAGGTTCT 

4125.7 

ATGAAAAAACAAAATAATGGTTTAATTAAAAATCCTTTTCTAT^ 

CCAGTATTTCTATTCTGGGAATAACTCAGGAGGAAGTCAGCAAATCAACTATACTC 

CGATGGTAATCTAAAAGAATTAACTTACCAACCAAATGGTAGTGTTATCGAAGTTTCTG 

A AAACA AGTAAAGAAGAAACAGGTATTCAGTTrrrCACGCCATCTGTTACTAAGCT 

ATTmCCTGCAGATACTACCGTATCAGAATTGCAAAAACTTGCTACTC 

ATGAAAGTTCAAGTGGTATATGGATTAATCTACTCGTATCCATTGTGCCATTTG 

TCTATGATGGGAAATATGGGAGGAGGCAATGGCCGTAATCCAATGAGTTTTGGACGTAGTAAGGCTAAAGCAGCA 
AATAAAGAAGATATTAAAGTAAGATTTTCAGATGTTGCTGGAGCTGAGGAAGAAAAACAAGAACTAGTTGAA 
GTTGAGTTCTTAAAAGATCCAAAACGATTCAG\AAACTTGGAGCCCGTATTCCAGCAGGT (jl 1U 1 M GGAGGGAC 
CTCCGG GG ACAG GTAAAACTTTGCTTGCTAAGGCAGTCGCTGG AG AAGCAG G 1 G TT C CATTCTTT 
TTCrGACTTTGTAGAAATGTTTGTCGGAGTTGGAGCTAGTCGTGTTC 1 1 1 GAGG ATGCCAAAAAAGCA 

GCACCAGCTATCATCTTTATCGATGAAATTGATGCTGTTGGACGTCAACGTGGAGTCGGTCTCGGCGGAGCT 
ACGAACGTGAACAAACCTTGAACCAACTmGArrcAGATGGATGCTITTTG 

TCGCTGCGACAAACCGTTCAGATGTACTTGACCCTGCCCTTTTGCGTCCAGGA CU Mil G ATAG AAAACTATTGGT 
TGGTCGTCCTGATGTTAAAGGTCGTGAAGCAATCTTGAAAGTTCACGCTAAGAATAAGC 

CTTTAGCAGAAGAT GTTG ATTTG AAATTAGTGGCTCAACAAACTCCAGG L' 1 HUi 1 GGTGCTGATTTAGAGAATGT 
CTTGAATGAAGCAGCTTTAGTTGCTGCTCGTCGCAATAAATCGATAATTGATC 

GATAGAGTTATTGCTGGACCTTCTAAGAAAGATAAGACAGTTTCACAAAAAGAACGAGAATTGGTTGCTT 
GAGGCAGGAC^TACCVTTCTTCGTCTAGTCTTCTCGAATGCrCGCGTTGT 

GCCGTGCAGGCGGATAGATGATTGCACTTCCTAAAGAGGATCAAATGCITCTATCTAAAGAAGATATGAAAGAGC 

AATTGGCTGGCTTAATGGGTGGACGTGTAGCTGAAGAAATTATCTTTAATGTC 

ACTrTCAACAAGCGACACAAATGGCACGTGCAATGGTTACAGAGTACGGTATCAGTGAAA^ 

AATATGAAGGAAACCATGCTATGCTTGGTGCACAGAGTCCTCAAAAATCAATT^ 

TTGATGAAGAarnCGTTCATTATTAAATGAGGCACGAAATAAAGCTGCTGAAATTATTCAOT 

TCACAAGTTAATTGCAGAAGCATTATTGAAATACGAAACATTGGATAGTACACAAATTA 

AGGAAAGATGCCTGAAGCAGTAGAAGAGGAATCTCATGCACTATCCTATGATGAAGTAAAGTCAAAAATGAATGA 
CGAAAAATAA 
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4125.10 

atgagggaaccacatttttta aatca ttttctcaagaag 
tttctggtggattagattccatgtttctatttaaggtattgtctacttatcaaaaagag^ 
ctagctcatgtgaatcataagcaga gaattg aatcagattgggaagaaaaggaattaaggaagttggct 
gca^gcttcctatttatatcagcaatttttcaggagaattttcag 

TTTTTC^GAGGTCATGAAAAAGAC^GGTGCGACAGCTrrA 

cgatttttatgcgcttgattcgaggaactcgcitgcgcratctatcag 

gatagaaatcattcgtc ccttc ttgcattttcagaaaaaagactttccatcaattt^ 

aatcaggaqaatcattatttrcgaaatcgtatt cgaa attctracttaccagaatr 

ttagggatggkatcttaggcattgg caat gaaatrttagattatgatrrggcaatagct 

taatgtggaagatttacagcagttattttcttactctgagtct 

gttttccagatttgaatcrtacaaaagctcagtttgctg 

tcatccgattaaaaatggctatgaattgataaaagagtaccaacagtttcagatttgtaaaatcagtc 

g atg aaaagg aag atg.aacttgtgttacactatcaaaatcaggtagcttatcaaggatattt 1 ' 1 1 if 1 1 1 t ggac 

ttccatt agaag gtgaattaatrcaacaaatacctgtitcacgtgaaacatccatacacat^ 

aggagatgttttgattaaaaatgggcatagaaaaaaactcagacgtttatttattg 

atttgaaaatccctatggaaaagagaaactctgctcttattattgagcaatttggtgaaattct 

aattgcgaccaataatttgagtaaaaaaacgaaaaatgatataatgaacactgtactttatatagaaaaaataga 

TAGGTAA 
4126.1 

ATC v AGCGTTCTTCTC" Hi' I AGTTAG AATG GTTATTTCCATCTTTCTCK3TCTTTCTCATTCTCCTAG CTCTGGTTGGA 

ACTyrCTA CTATC AATG\AGTTCTTGVGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCA 

ACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCAC 

TGGCCrATGCTGAGAATCCCAGTCAAG ACAAGGTCGAGGG AATCCG AGA 1 ' I IGITI I ' l GACCATCITGAAGTCAGA 
TAAGGACTTGAAAACTGTTGTGCTCGTGACCAwAATCTGGTCAGGTCATTTCT 

AACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATG\GGGAGCTATGCCTGTTTTGACT 
GCTCG TAAAT CAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAA L l'[ C j I 'l GATGCAAAGGGAGCCAATCTTGGTG 
TGCTTCGTTTGGATATTTCTTATGAAACTCTGGAAGCCTATCTCAATC^ 
CTTCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACAGTTTATAGT^ 

GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 
GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCT 
ACCTTGCTTGGGGCCACrrCrrCACATCTCTTCTrGTCTGTCTCTGOT 
TCCTTTXjAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCT 

gcctatcaactgagagaagtaactcgccaatttaatgctatgttggatcagattgatcagtt 

gt agcc aggaagaaacgacccgtcagtaccaacttcaagccctttcgagccagattaatccacatttcctcta^ 

cactttggacaccatcatctggatggctgaatttcatgatagtcagcgagtggtgcaggtgacca^ 

acct attt ccgcttggcgctcaatcaaggcaaggacttga rnxi ' l 'ctcrctgacgaaatcaatcatgtccgccagt 

atctctttatccagaaacaacgctatggagataagctggaatacgaaattaatgaaaatgttgcc^ 

agtcttacccaagctggtcctacaaccccttgtagaaaatgctct 

gggccatattaaac1 l'lclgl'ccagaaacaggattcgggattggtcatccgtattgaggatgatggcgttggcttc 

caagatgctggtgatagtagtcaaactrcaacrcaaaccrrgggggagtrggtct^ 

aaacttcattttggagccaattaccatatgaagattgattctagaccccaaaaagggacgaaagttgaaatatat 

ATAAATAGAATAGAAACTAGCTAA 



4126.7 

ATG AAGCGTTCTTCTC 1 I'l I AGTTAG AATGGTTATTTCCATCTTTCTGGTCTTTCTCATTCTCCT^ 
ACTTTtrTACTATCAATCAAGTTCT^ 

ACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACTGGTT^ 

TGGCCTATGCTGAGAATCCCAOTCAAGACAAGGTCGAGGGAATCCGAGA rn CiTl'l'l'l GACCATCTTGAAGTCAGA 
TAAGGACTTG AAAAC UGTGACCAAATCTGGTCAGGTCATTTCTACAGATGACAGTGTGCAGATGAA 
AAC ITCCTC rC ATATGATGGCTGAGG ATTGGTACCAAAAGGCCATTG^TCAGGGAGCTATGCCrGTTTTGACTCCA 
GCTCG TAAAT CAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAA L 'ri G I 'l GATGCAAAGGGAGCCAATCTTGGTG 
TGCTTCGTTTGGATAmcrTATGAAACTCTGGAAGCCTATCTCAATCAACTCCA 
CTTCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACAGTTTATAGTTCGTCT 
GCTATCAAACCCTACATCGATACACGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCACT 
GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTrAGACCAGGTTCGGACT 
A CCTrG CrrGGGGCCAGTGTCACATCTCrrCTTGTCTGTCTCTGCTT 

TCCTTTGAAGGATTTGAGAGAAACCATGTTGGAAA 1 I GCI ' I'CTGGTGCTCAAAATCTTCGTGCCAAGGAAGTTGGT 

GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTrcGATCAGATTGATCAGTTGATGGTAGCTATTC 

GT AGCC AGGAAGAAACGACCCGTCAGTACCAACTrCAAGCCCTrrCGAGCCAGATTAATCCACATTTCCTCT 

CACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAGTC^GCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 

ACCT ATTT CCCCTTGGCGCTCAATCAAGGCAAGGACTTGATTTGTCTCTCTC 

ATCTCTrrATCCAGAAACAACGCTATGGAGATAAGCTGGAATACCAAATTAATGAAAATCTTGCCT^ 
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actcttacccaag ctcct cctacaacc^ 

gggccatattaaactttct.gtccagaaacaggattcgggattggtcatcot 

caagatg ctggtg atagtagtcaaagtcaactcaaacgtgggggagttggtcttcaa 

aaacttcarritggagccaattaccatatgaagattgattctagaccccaaaaagcwa 

ataaatagaatagaaactagctaa 

41 27.4 

ATGTTT TTTAA ATTATTAAGAGAAGCTCTTAAAGTCAAGCAGC 
TTTGGTCTTTCCrrATCGGAACTAGCATTACAGTTCCT 
CCTTCTTAAACATGTTGAGCTTtjGTGTCGGGGAAT 
CTATATCACCGCTTCTATTGTTGTCCAACTCTTGCAA^ 

GGGGAAGTAGG TCGAAO AAAATTG AATCAAGCTACTCGTTATATTGCTCrAGTTCTCG L ' 1 ' IT 1 0 1 G CAATCTATCC 
GGATTACAGCTGCrrTTAATACCTTGGCTGGAGCTCAATTG 
GATTGGTATCATCTTAACAGCTtKjTAGTATGATTGTCACTTGGTTGGGTC 
AACGGTGTTTCCATGATTATCTTTGCCGGGATTGTTTCCTCAATTCC^ 

A CiTI U I GAACGTCCCAAGTAGCCGTATCACTTCATCTATCATTTTCGTAATCA l'l'I'l GATTATTACTGTATTGTTG 
ATTATTTACTTTACAACTTATGTTCAACAAGCAGAATACAAAATTCCAAT^ 

CTCCATCTAGCTCTTACCTTCCGTTAAAAGTAAACCCTGCTGGAGTTATCC C ' IXJl 1 ATCTTTGCCAGTTCGATTACT 
GCAGCGCCTGCGGCTATTCTTCAU ITIT IGAGTGCCACAGGTCATG ATTGGGCTTGGGTAAGGGTAGCACAAGAG A 
TGTTGGCAACTACTTCTCCAACTGGTATTGCCATGTATG C li'lG ' 1T G ATTATT L ' 1 C 1 ' I ' l ACATTCTTCTATACGTTTG 
TACAGATTAATCCTGAAAAAGCAGCAGAGACCTACAAAAGAGTGGTGCCT^^ 
AGGTACAGAAGAATATATGTCTAAACrrCTTCGTCGTOT 



4127.5 

ATGCATATTAGACAACTTACTGAmAACCATCGCCATGATTGAGGAGCAAAACITCGATATTA 

GGGATTTCTC! 1 1 IGGACTGTATCGATCCAGATATCAATCGTGCTGCGGAGAAAATCTATCAAAAAATTACGACAA 

AGGCGGCTAATTTAGTAGCTGTTGGTGATCAAATTGCGGCTGAGTTGGGAATTCCTATCGT^ 

GGTGACACCTATTTCTCTGATTGGGGCAGCGACAGATGCGACGGACrACGTGO TT CTGGCAAAAGCGCTTGATAA 

GGCTGCGAAAGAGATTGGTGTGGACnTATTG G r GG 11 I T l CTOCCTTACTACAAAAAGGTTATCAAAAGCGAGAT 

GAGATTCTCATCAATTCCATTCCTCGCGCTTTGGCTGAGACGGATAAGGTCTGCrCCT 

CCAAGTCTGGTATTAATATGACGGCTGTGGCAGATATGGGACGAAra 

TGGGAGTGGCCAAGTTGGTTCTATTCGCTAATGCTGTTGAGGACAATCCATTTATGG^ 

TGGGG AAG CAGA TGTTATCATCAATGTCGGAGTTTCTGGTCCTGGTGTTGTGAAACG' ItiC 1 1 1 GO AAAAAGTTCGT 
GGACAGAGCn'IXjATGTAGTAGCCGAAACAGTTAAGAAAACTGCCTTTAAAATCACT 

GTCAAATGGCCAGTGAGAGACTGGGTGTGGAOM 1 G GTATTGTGGACTTGA G Ml li GCACCAACCCCTGCGGTTCO 
AGACTCTCrn30CACGTCrrCCTTGAGGAAATGGGGCTAGAAACAGTTG 

CCTCTTGAACGACCAAGTTAAAAAGGGTGGAGTGATGGCCTGCAACCAAGTCGGTGGTTTATCTGGTC 

CCTGTTTCTGAGGATGAAGGAATGATTGCTGCAGTGCAAAA 

CGGCTATCTGTTCTGTTGGATTGGATATGATTGCCATCCCAGAAGATACGCCTGCT 

TGCGGATGAAGCAGCAATCGGTGTTATCAACATGAAAACAACAGCTXjITCGTATCATTCCCAAA 

CGATATGATTGAGTTTGGTGGTCTATTAGGAACTGCACCCGTTATGAAGGTTAAT^ 

ATCTCTCGCGGTGGACAAATCCCAGCACCAATTCATA U 1111 AAAAATTAA 

4128.1 

ATGACACAGATTATTGATGGGAAAGCTTTAGCGGCCAAATTGCAGGGGCAGTTGGCTGAAAAGACTC 

AAGGAAGAAACAGGTCTAGTG C ' L ' IXjG l ' 1 ' l GGTAGTGATTTTGGTTGGGGACAATCCAGCCAGCCAAGTCTACGTT 

CGCAACAAGGAGAGGTCAGCCCTTGCGGCTGGTTTCCGTAGCGAAGTAGTACGGGTTC 

GAGGAATTGTTAGACCTGATTGCrAAATACAATCAGGATCCAGCTTGGCATGGGATTTTGGrr 

GAAAACACATTGATGAAGAGGCGGTTCTATTGGCTATTGACCCAGAAAAGGATGTGGATC 

ACATGGGGCGTCTTTGGTCTGGTCATCCAGTCATGATTCCTTCGACACCGGCAOT 

ATATGGGATTGACTTGGAAGCTAAAAATGCAGTCGTCATCGGTCGATCCAATATTGTCGGA 

GC 1 1 CI 1 11 OGCAAAGAATGCAACAGTAACCTTGACTCACTCACGTACTCATAATCTTTCCAAGGTGGCTGCAA^ 

GCAGATATT C l bO MG 1 1 O CAATCGGTCGTCCCAAGTTTGTG ACTGCTG A C IT 1 G I CAAACCACGTGCGGTAOTCA 

TTGACGTTXjGGATGAACCGCGATGAAAATGGTAAGCTCTGTGGGGATGTTGATTATGAGGCGGTTGCC^ 

TAGCCACATTACXjCCAGTCCCTGGAGGTGTCGGTCCTATGACCATTACTATGCTGATGG 

gcacttaggacattggatagaaaataa 

4128.2 

atgtctaaatttaatcgtattcatttggtggtactggattctgtagg 

ttgtcaatccaggggttccagatggagcrtctgacacactgggacacatttcaaaaac^ 

aaacatggctaaaataggtcttggaaatattcctcgtgaaactcctcttaagactgtagcagctgaaagcaatcc 

aactxjgatatgcaacaaaattagaggaagtatctcttggtaaggatactatgactggacacttc 

actcaacattactgagcctttcgatacmctggaacggatrcccagaagaaatcctc 

tcagg acgcaaggttattcgtgaagccaacaaaccttattcagg aacggctgttatctatg a 1" vi iugaccacgtc 
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TCCrn'GGATCAATTGTACCGTATCTGTGAATACGCrCGTTCGATTACCCTTGAGCGTCCTGCCL'I IU fO O TCGCA 
TCATTGCTCGCCCTTATGTAGCrrGAACCAGGTAACTTCACTCGTACGGCAAACCGTCGTG 
ATTTTTCCCAACTGTTTTGGATAAATTGAATGAGGCrGGTATCGATACT 
D TTTAACGGTGCTGGTAT CAAC G\TGACATGGGTCACAACAAGTCAAATAGTCATGGAATTGATACACT 

ACTATGGGAc 1 1 i uAGTTTGAAAAAGGATTCTCATTCAG\AACCTACTTGACTTTGATGCCCTTTACGGCCATC 
GTCGTAATGCTCACGGTTACCGTGATrGCTTGC^^ 
AGAGAATGACCTTCTCTTGATTACTGCGGACCATCGAAATGAC 
ATATATTCCATTGTTGGCCTATAGCCCTGCCTTTAAAGGAAATGGTCTCATTC 
I U CAGCG ACTGTTGCCG ATAACTTTGGTGTGG AAACTGCTATG ATTGGGG AAAGTTTCTTAGATAAATTGGTATAA 

41 29.2 

ATGTTTATTTCCATCAGTGCTGGAATTGTGACATTTTTACT 

TAGAAAGGC GCAAATTACA GGCG\GCAGAT GCATGA GGATGTCAAACAGCATCAGGCAAAAGCn}GGACTCCTA 
1 5 CAATGGGAGGTTTGG TTTT CTTGAT TACIT CTGTTTTt}GTTG L 1 11 Ul ITr CGCCCTATTTAGT AGCCAATTCAGCA 

ATAATGTGGGAATGATTTTGTTCATCrnXjTCTTGTATGGCrrGGTCGGATT^ 
CGTAAAATCAATGAGGGGCTTAATCCTAAGCAAAAATTAGCTCTTCAGCITCTAGGTGGA 
TCTAT GAGCGCGGTG GCGATAT CCTGTCTGTCTTTGGTTATCCAGTTCATT^ 

Li 1 1 i CTGGC TAGTCGUn i 1 LCAAACGCAGTAAACTTGACAGACGGTGTTGACXjGTTTAGCTAGTATTTCCGTTGT 
GATTAGTTTGTCTGCCTATGGAGTTATTGCCTATGTGCAAGCTTCAG 

TTGGTGGTTTGCTCGO i n c r I CATCTTTAACCATAAGCCTGCCAAGGTCTTTATGGGTGATGTGGG AAGTTTGGCC 
CTAGG TGGGATGCTG GCAGC TATCTCTATGGCTCTCCACCAAGAATGGACTCTCTTGATTATCGGA^ 
t m iGAAACAACTTCTGTTATGATGCAAGTC^GTTATTTC 
GCCTOTACATCACCATTTTGAGCTTGGGGGATTCnrCTCGTAAAG 
ZJ TTCTTTTGGGGAGTGGGACTTCTAGCAAGTCTCCTGACCCTAGCAATT^ 

41 33.1 

TTGTTTAAGAAAAATAAAGACATTCTTAATATTGCATTGCCAGCTATGGG^ 
- n GAATGGTGGACAGTTA TTTGG TTGCTCATTTAGGATTGATAGCTATTTCAGGGGTTTCAGTAGCT 
iU CACCATTTATCAGGCGATTTTCATCGCTCTGGGAGCTGCTATTTCCAG^ 

GACCAGTCGAAGTTGGCCTATCATGTGACITjAGGCGTTC 

GTCC ATCTT CGCnKjGAAAGAGATGATAGGACTT TTGGG GACGGAGAGGGATGTAGCTGAGAGTGGTGGACT 
TCTATCTTTGGTAGGCGGATCGATTGTTCTCTTAGGTTTAATG 

AATCCACGTCTGCCTCTCTATGTTAGTTTTTTATCCAATGCCTTGAATATT C1 Ml 1 l CAAGTCTAGCTA n ' ri ' lGl 1 
J J CTGGATATGGGGATAGCTGGTGTTGCT TGGGG GACAATTGTGTCrC O 11 ' 1 IrO 11 G CTCTTGTG A IT 11 0 1 b OTCAC 

AATTAAAACTGCCTTATGGGAAGCCAAL1 1 1 1 UO 1 TrAGATAAGGAACTGTTGACCTTGGCTTTACCAGCAGCTGG 

AG AGCGACTTATG ATGAGGG CTGG AGATG TAGT G ATCATTGCCTTG Ci 1 (JO I ' l ' lCTl T 1 G GGACGGACGCAGTTGCT - 

GGGAATGCAATCGGAGAAGTCTTGaCCCAGTTTAACTATATGCCT 
. n TGTTGGCCCGAGCAGTTGG AG AGG ATGATTGG AAAAGAGTTGCTAGm^ 

4U GTTCCTCATGTTGCCCCTGTCCTTTAGTATATATGTCTTGGGTGTACCATTA^ 

TAGCGGTGGAGGCTAGTGTTCTAGTGACACTGTTTTCACTACTTGGGAC^ 

TACGGCACrrCTGGCAGGGATTAGGAAATGCACGCCTCCCTTmATGCGACAAGTATAGGAATOT 

ATTGGGACAGGATATCTGATGGGGATTGTGCTTGGTTGGGGCITGCCT 

ATGO 1 1 1 1 1 CGCTGCITATTTCT ACGCT ATCGTT ACCAGCGCT AT ATGACHTTTG AAAGG ATAG 

45 

4133.2 

ATGCAAACTCAAGAAAAACACTCGCAAGCAGCCGTTCTTGGCITG 

ATCCTGGTTCCCATCATGATTGCGACAGCCCTTGGCTATTCAGCTCAGCAGTTGACCT 

TCTTCATGTGTGGGGTGGCAACCTTCCTCCAACTCCAACTCAACAAATA 

TGGAGTTCCATTCCAGTCGGTCGCTCCCTTGATTATGATTGGGCAAAGC 

CTTATCGCATCTGGGATTTACGTGGT TCTTG TTTCAGGCATCTTCTCAAAAGTAGCCAATCT 

AACAGGATCTGTTATTACCACGATTGG I 'ri'AACCTTG ATCCCTGTCGCTATTGGAAATATGGG AAATAACGTTCCA 

GAGCCAAOXiGTCAAAGTCTCTTGCTrGCAGCTATTACT 

GATTTA TCAAOTC TATCTCTATTTTGATTGGTCTGG T'l U 1'ltjGAACTGCCATTGCTGCTACTATGGGCTTGGTGGAC 
JO TTCTCTCCnTTTCCCKrrAGCTCCACTTGTCCATCT 

ATCTATTGTCATGATGTGTATCATCGCAACGGTGTCTATGGTTGAGTCAACTGGTO 

TCACAAAGGATCCAATCGACAGCACGCGCCTTCGCAACGGATACCGCGCAGAAGGTTTGGCCGTACTTCTCGGAG 

gaatcittaacaccttcccttacaccotattttcacaaaa 
cctgccaatctactacgcagctggtttcctggttctccttggactgcttcct 
ou ttccaagctccgtcctcggtggtgccatgctggtaatgtttggttttgtatc^ 
ccgtgttgactttgctaacaatgaacacaacttcctta 

aacagtaatctctttgtcagcatgccgacagccttccaaatgttcttctcaaacggaatcgtcgtagccagcctac 
tcgctattgtcctcaatgccgtattaaatcataaaaagaaataa 

65 4136.2 
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ATGAAAGATAGAATAAAAG^TATmCAAGA 
AAAGACAGTTCCAAGGATTTTCGTGAGTTGATrAAAAC^ 

AAGAAGA TGGT A GTCTG ACATTAGAAATTAAGAAAAAACATGAGATTACCCTCAAGGGGATTTT^ 
AAAATGGCTTTGGCTTTGTTAGTCTGGAAGGCGAGGAGGACGACCTTTTTGTAGGGA 
5 T ATTG ATGGTG ATACCGTCG AGGT AGTG ATTAAG AAAGTCG CTG ACCGCAAT AAGGG AACAGCAG CAGAAGCCAA 

AATTATTGATATCCTAGAACACAGTTTGACAACAGTTGTCGGGCAAATCGTTCTGGATCAGGA^ 
GCTGGCTATATTCGTTCAAAAAATCAGAAAATCACTCA^ 

GG AACAG AAGTTCTCAAAGTCTTTATCG ATAAATACCCAAGCAAG AAACATG A 1 ' ITLT VIU I 'CGCGAGTCTTCTCG 
ATCTAGTGGGACACTCAACGGATGTCGGAATTGATGTTCTTGAGGTCTTXjGAATCA^ 
10 TCCAGAAGCTGTTGTTAA GGAA GCAGAAACTGTGCnXSATGCTCCCT 

TCTAAGAGATGAAATTACCTTTACCATTGACGGTGCGGATGCCAAGGACTTGGA 

TCTGAAAAATGGCAATCTGGAGTTTGGGGTTCACATCGCAGAT G IT1 Ci ' 1 ATTATGTG ACCG AGGGGTCTGCCCTT 
GACAAGGAAGCCCTTAACCGTGCGAC 11 ClU ni'ACGTGACAG ACCG AGTGGTGCCAATGCTTCCAGAACG ACTA 
TCAAATGGCATCTGCTCTCTG\ATCCCCAAGTTGACCGCCTGACCCAGTCTGCTATTAT 

15 GGAGATTGATAAACATGGTCGTGTGGTCAACTATACGVTTACACAAACAGTTATCAAGACCA G ' t ' I'l' l tlXj I'ATGACC 

TATAGCGATGTCAATGATATCO^GCTGGCGATGAAGAAAA^ 
GAACTCATGGCCAAGCTTCATGAAACTTTAGAAAACATGCGTGTGAAACGTGGAG 
AAGCGAAG ATTTTA GTGGATAAACAAGGTAAGCCTGTTGATATCGTTCTTCGGCAGCGTGGTAT^ 
TGATTGAGTCTTTTATGTTGATGGCTAATGAAACAGTTGCCGAAC 

20 CG AATTCACG AGG AGCCT AAGG CTG AAAAGGTTCAGAAGTTTATTGATTATGCTTCG AO I'T 

ATGGAACTGCCAGTGAGATrAGTCAGCAGGCACTrCAAGACATCATGCGTGCTGTTXjAGGGAGAAC 
ATGTATTGTCCATGATGCTTCTTCGCTCTATGCAGCAGGCTCGTTATTCGGAGCACAATC^ 
AGCTGCTGACTATTATACTCACTTTACCAGTCCAATTCGTCGTTATCCAGAC C ' rrcr r G ' l 1 CACCGTATGATTCGGG 
ATTACGGCCCrrrCTAACKiAAATAGCAGAGCATmGAACAAGTGATTCCAGAGArrc 

25 GTGAACGTCGTGCCATAGAAGCTGAGCGTGAAGTCGAAGCCATGAAAAAGGCTGAGTATATGGAAGAATACGTGG 
GTGAAGAGTATGATGCAGTTGTATCAA GTATT GTCAAATTCGGTCTCnTTGTCGAATTGCCAA^ 
CTTGATTCACATCACTAATCTGCCTGAATmATC^TrrCAATGAGCGTGATTTGACT 
GGTA TCACTTTCCGAGTGGGTCAGCAGATCCGTATCCGTGTIXiAAAGAGCGGATAAA^ 
mTCATTCGTACCTAGTG AGTTTGATGTGATTGAAAAAGGCTTG AAACAGTCTAGTCGT 

30 AGTGGCAGAGGGCGTGATTCAAATCGTCGTTCGGATAAGAAGGAAGACAAGAGAAAATCAGGACGCTCAAA 
TAAGCGTAAGCATTCACAAAAAGACAAGAAGAAAAAAGGAAAGAAACCTTTTTACA^ 
GAGCCAAGCATGGCAAAGGGCGAGGCAAAGGTCGTCGCACAAAATAA 

4137.2 

35 ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCTTGATTGTTTAT 

CTATTTCTCTAAAAAAGAGATGAAAGGAAAAGA Cj 1 i Cl'I r AAAGGAGATGGTTCGGTTCCTTGGTATGTTACTrCG 
GTATCCATTTTTGCCACAATGCTCAGTCCGA 1 1 1 LL ' riCl ' l GGGACTCGCTGGTAGCTCTTATGCAGGTAGCrGGAT 

tttatggtttgctcaattagggatggtagtagctattccactgacaa mcg l' l 'l 1 atcttacct atctttg cacgga 
. _ taga catcgatacggcatatgattacttggataaacgttttaattctaaagcacitcgtattatt^ 
40 gtttattattta tcaa ttgggacgtatctcratcattatgtacctcccatcag 

ttgacatcaatattttgattattttqatgggtgtagttgcaat^ 

ttatggaca gact ttattcaaggtgtgattctgattagtggtgtcgttttagctttatttgta 

taaaggtgg ctttg gtgcagtagcagaaacattagcaaacgggaaattccttgctgcaaatgaa^ 

tcctaacttgcttrcaaactrcatattttaattct 
45 tcaagatttggttcaaccttttactacaacacaa^ 

ttctcacttgcaactgcaacagtcttttacttgattggtacaggcttgta 

atagtgcacctagcaatatccctcaagaccaaatcmatgtactttattc 

aggtttgatcttggcagcgatttatgcagcatctcaatcaactatttca^ 

tggacattggatattcaagatgtcatttctaaaaatatgtcagacaatcgtcgtacgaaaattgcacaa^ 
50 ctctagcagtaggtttattctcaattggtgtttccattgtcatggctcact 

ttcaatacrrttcatgggacttgtacttggtctaot^ 

TAAACA^GGTGCTTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATT^ 

gctgttagctactgggcatattgattgatttcaatctctgtatc^ 

tgg aaat aaagt atctg cacct aaatat ac aacg attc atg at att ac ag aaattaaagcgg attcaagttggg a 

55 AGTTCGTCACTAA 
4138.1 

ATGAAATTTAGTAAAAAATATATAGCAGCTGGATCAGCTGTTATCGTATCCTTGAGTCTATGTGCCTATG 
ACCAGCATCGTTCGCAGGAAAATAAGGACAATAATCGTCTCTCTTATGTtaG 

00 GTGAAAACTTGACACGAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGCTGAGCAAAT^ 
ATCAGGGCTATGTAACGTCACACCGTGACCACTATCATTACrATAATGGGAAAGTTCC^ 
TGAAGAACTCTTGATGAAGGATCCAAACTATCAACrrAAAGACGCTGATATTGTCAATGAACT^ 
TATCATCAAGGTCGATGGAAAATATTATGTCTACCTGAAAGATGCAGCTCATtjCTGATAATGTTCGAACTAA^G 
G AAATCAATCGTCAAAAACAAGAACATGTCAAAGATAATGAGAAGGTTAACTCTAATGTTGCT 

65 CAGGGACGATATACGAGNAATtjATGGTTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCTTATA 
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TCGTTCCTCATGGAGGTCACTATCACTACATTCCCAAAAGCCATTrATCTCCTAGTGAATTAGCACCAGCrAAAGC 

ACATCTGCCTGGAAAAAATATGCAACCGAGTCAGTrAAGCTATTCTTCAACAGCTA^ 

TGTAGCAAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAATCTCCAGAGTCTTT^ 

TTCACCTAGCGCCCAACGTTACAGTGAATCAGATGGCCTGGTCnTTC 

AATGGAGTTGCGATTCCGCATGGCGACCATTACCACTTTATTCCTTACAGCAAGCTTTCTC 

TTGCCAGAAT GGTG CCTATCAGTGGAAC TGGTT CTACAGTTrCTACAAATGCAAAACCTAATC 

TCT AGGCAGTCTTTCAAGCAATCC ITC l'l CT IT AACG ACAAGTAAGG AGCTCTCTTCAG CATCTGATGGTTATATTT 

TTAATCCAAAAGATATCGTTGAAGAAACGGCrACAGCTTATATTGTAAGACATCK^ 

AAAATCAAATCAAATTGGGCAACCGACTCTTCCAAACAATAGTCTAGCAACACCTTCTCC^ 

CG\G GAACT TCACATGAGAAACATGAAGAAGATGGATACGGATTTGATGCTAATCGTATTATCGCTGAAGATGAA 

TCAGGTTTTGTCATGAGTCACGGAGACCACAATCATTATTTCTTCAAGAAGGACTTGACA 

GTGCGCAAAAACATTTAG 

4139.1 

ATGAAAAAAAGAGCAATAGTGGCACTCATTCTACTGClT'n 

TCCAGCAGATTCCACTGGGTGAAGTGCGCTCCTGGATCCCCAATTTCGTTAGCTTGACCTACCT 
TGCAGCCTTTTCTATCTTACAAGATCAGCAGCTGTTATTCGCTGTCATTACTCTGGTTGTCGTG 
GGT ATTT ACATAAACACATGGAGGACT CATT CTGGATGGTCTTGGGTTTGACTCTAATAATCGC 
AAACTTTATTGACAGGGTCAGTCAGGGCTTTGTTGT GGAT ATGTTCCACCTTGA 

ATGTGGCAG AT AG CT ATCTG ACG GTTGGAGTGATTATTTTATTGATTGCAATGCTAAAAGAGGAAATAAATGGAA 
ATTAA 

4139,5 

atgaatacaaatcttgcaagrrttatcgttggactgatcatcgatgaaaacgacc gi 1 1 1 1 actttgtgcaaaagg 
atggtcaaacctatgctcrrgctaaggaagaaggccaacatacactraggggatacggtcaaaggt^ 
cggatatg aagcaaaaactccgcctg a caacc ttagaagtgacrgccactcaggaccaa l ' t ' log 1'tggggacgtg 
tcacagaggttcgtaaggacttgggttn'ctttgtggatacaggccttcctgacaaggaaatc g 1 1g1g tcactcga 
tattctccctgagctcaaggaactctggcctaagaagggcgaccaactctacatccgtcttgaagtggataa 
agaccgtatctggggcctcttggcttatcaagaagacttccaaccrrcttgct 

aaccaa^crggccagccattgttt accgtctcaagctgtcagg aac 11 ' 1 ' i u h ' 1 acct accag aaaat aatatgc 

rrggttttattcatcctagcgagcgttacgcagagccacgtttggggcaagtattagatc 

ccgtgaagtcgaccgcactctgaacctctccctca^ 

a tttt gacttatttggaaagcaatggcggtttc atga ccttaaatgacaagtcatctc 

cctttggcatttctaaaggtcagttcaagaaagctttaggtggtcttatgaaggctg 

gtttgggacagagttgatttag 

4139.8 

ATG AAAG ATGTT AGTCTA I'll 1 1 ATTGAAAAAAG 1111 CAAAAGCCGCITAAACTGG ATTGTCTTAGCTTTATTTGT 

ATCTGTACTCGGTGTTACCTTTTATTTAAATAGTCAGACTGCAAACTCACACAGCT 

CGCATTGCAGCCAACGAGAGGGCTATCAATGAAAATGAAGAGAAACTCTCCCAAATGTCT 

GAATACCAGTTTGCTAAAAATAATTTAGACGTGCAAAAAAATCTITrcACG 

TAAAAGAAGGGCGCTGGAAAGAAGCCTACTATTTGCACrrGGCAAGATGAAGAGAAGAATTATC 

ATGACCCGACTGCTAGCCCTCGCTTA^ 

TGAACATAAAAGCACATACTTTGGAGTTTCCGACCCACGGGATTGATX1AGATTGTCTGG 

CCCAAU 1'1'IU n'rGTCGT TGCT ATTAl'l 1 11 ATGCT AACACAACTATTTGCAG AAAGATATCAAAATCATCTGG AC 
ACAGCTCACTTATATCC l'U'l'n CAAAAGTG AC ATTTGCAATATCCTCTC7ITGGAGTTGG AGTGGGATATT7TAACTG 
TGCTGTTTATCGG AATCTGTGG C i ITllTnT CTAGTGGGAAGTCTGATAAGTG G ' lTnG GACAGTrAGATTATCCC 
TACCCAATTTATACCTTAGTGAATCAAGAAGTAACTATT 

T AGCTTTCTT AGCCTTT ATCGTCATTGTGG AAGTTGTGTACTTG ATTGCTTAC 1T1 11 CAAGCAAAAAATGCCTGTC 
CTU 1 1 CM' l CAC TCATTGGGATTGTTCGCrTATTGTTTGGTATCCAAACCATTCAGCCT 

TCTG ATTCCCTTTACTTACnXjCGT 1 CAGTGG AGATTTTATCTGGAAGATTACCTAAGCAGATTG ATAATGTCGATC 
TAAATTGO AGCATGGG AATGGTCT TACTT CCTTGCCTG ATTAT C 1 VI 1 1 O CTATTGGG AATTCTATTTATTGAAAGA 
TGGGGAAGTTCACAGAAAAAAGAATTTTTTAATAGATTCTAG 

4141.1 

ATGATGAAGTTGATATTGGATATTGTTAGTACACCAGCTATTTT^ 

TCAGAAGAAGAAATTACCT GATA TTATTAAAGGTGGAATTAAGAC L ' lMU ' Q M rr G <j ' rri L ' l ' l AGTTGT ATCTGGTG GT 
GCAGGAATTGTACAAAATTCTTT AAA TCCA TTTGGT ACCATGTTTCAGCATG L ' I ' 1 ' 1 1 CATTT ATCTGGCGTTGTG CC 
G AAT AATG AAG CAATTCT AGCTGTAG CTTT AACAACAT ATGG CTCAGCTACTGCAATG ATT ATGTTTGCAGG CATG 
GTGTTCAATATCTTAATCGCTCO' VVl 1'ACTCG ATTTAAAT ATA I'I'I'I'I'I'l AAC AGCGCACCACACTCT ATAT ATGGC 
ATGTATGATTGCGGTCATTTTATCAGTTGCTGGCTTTACTAGCTTGCCTCTCATCTTACTAGGAG^ 
GTATTATTATGAGTATTTCCCCAGCATTTGTGCAAAAATATATGGTTG\ATTAACT 

AGGTCATTTCAG 1 1 CI 1 ' 1 GGG AT ATTGGTTG AGTGO l'l'l' l" ACTGGT AGCCTTATCGGTG ACAAATCAAAATC AACA 
G AGG ACATTAAATTTCC AAAG AGTTTAGC 1 II 1 ' 1 ' 1' ACGTG AT AGT ACTGTT AGT ATT ACTTTATCCATGGC AGTT AT 
TTACATTATTGTAGCTATCrrTGCAGGGTCAGAATATATAGAAAAAGAAATCAGTAGTGGTACAAGTGCT 
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TATGCTTTACAATTAGCAGGTCAATTTGCAGC\GCCGTA 1' n'GTl'ATTTTAGCAGGTGTTCGCCTTATTTTGGGCGA 
AATTtTTTCCAGCCTrTAAAGGTATTTCAGAGCGTCTTOT 

ATACTTATGCACCCAATGCAGTTCTAATTGGATTTATCTCTAG ri ' i ' lUI ' I 'GGTGGTTTAGTAAGTATGGTAATTATG 
ArrccrTCAGGAACGGTTGTTATCTTACCAGCTCrrrGTGCCTCA 
5 GAATGCATCTGGTCGTGTTCGTGGAGCCACTATTTCA 

T mA ATGCCAGTTTTGGGACKJACTTGCrrrTCCAAGGATCAA 

ATTTTAGGAATGTTAAATCAATTTGGCTCACAAGCAGGCATTGTGATTGGTCl'lG MCI 1 ATTCTAGCAGTTATGTT 
TGGAGTATCCTTTATTAAAAAGCCATCTGCAACGGAGGAATAA 

10 4142.3 

ATGATTAAAACATTTCTCTCTGCCCrTTCGGTCA T T C ! C 1 1 l' I CTATCCCTATCATAACTTATT C IT ' It 'CCATCT 
TCTAATCT TAAC ATTTGGCTAT CTACCCAA CCTATCTTGGCACAGATTTATGCCTTCCCCTTAGO 
GGCT GCTATTTTAA Cin ' l Cl ' l 'A l 11 l ' l ' l i ' l CCTAT L I ' M ' 1 ' 1 'ACAAG AAAAATAAAG\AATACGGTTTTACTCTGGCA 
TTTTGCTCTTACTATCGCTCATATTACTATTATTCGGAACAGATAAAACC L ' rr ICl ' lH G CATC AAATAAG ACT AAA 
1 5 ACCTTAAAATTAGTAACITGGAACGTCGCTAATCAAATAGAAGCAGAAGATATTOAGCGAA I ' ll ' l 1 AGCCATTTTG 

A CGCC GATATGGCrATA TTCC CTGAACTAGCTACCAATATCAGAGGTGAGCAAGAAAACCAGAGAATCAAACTAT 
TGTTTCATCAAGTTGGACTTTCTATGGCCAACTATGATATTTTCACrrCTCCAC 

GTGACTGTG ATTGTCAAG AAA AGTTATGGTTTCTATACAG AAGCTAAAA C I ' l l I CATACAACACGGTTCGGG ACAA 
TTGTATTACATTCGAGAAAACAAAATATACCAGATATCATTGCCTTGCATACT 
20 GGAAATCTGGAAGCAAGACrTAAACATCATTCATAATCAATTGGCTTCAA 

GATTTTAATGCAACTATGCGTCATGGAGCACrrGCAAAAATAAGCTCTCATAGGGAC^ 

CCrTTTG AAAG AGG AACTTGG A ATAGCCAAAGTCCAAAA C 1 1 1 1 1 I ' l 'AATGCAACAATAGATCATATTTTATTGCCTA 
AAAACCACTACT ATGTTAAAG ATTTAG ACATTGTAAGTTTTCAAAACT ATGTA 1 ' 1 11 1 ACAGAAAT 

CACATTTTAA 

25 

4142.4 

ATCAATCCAATCCAAAGATCTTGGGCTTATGTCAGCAGAAACCGACT 
TGGTCTTATTGGCCGGAATTTCAGCCTGTTTGACTCTGATGAAGTCC^ 

ATCACTCAATACATCnil 1 CT ATT AAGAAG AT AG AG AATGGTCAGACATTCAAGTTGTCAG A CCTAGCATCTGTA 
30 AGCAAGATTAAGGGGCTGGAAAATGTCTCTCCTGAACn'GAGACGGTCGCAAAACT w 

ACTCGCGAGCAGAGCCTGGAGCGTGATGATmTCAGCTCCAG 
GATTCATCCAAGGATGTAACCTTTACCAGTTCGGCmCAATCrAAAAG^ 
TCCAAGAAAATCCTTATCCACGAAGAATrGGCTAAGAAGAACGGTCTTTC^ 

CTGOTCACrrCTG AATCTGG AAAAGG ACAAACAGTAG AGTTTG AG ATTATCGGCATU 1 l'I 1 CTG GT AAAAAACAAG 
35 AGAAATTCACAGGCTTGTCTTCTGACTTCAGTGAAAATCAAGTCm 

GGGCAATAGTGAAGCrCAAGTCACTGCAGCACGCTTCTATGTAGAAAATCCTAAGGAAATGGACGGACrCATG^ 

GCAGGTAGAAAACTTGGCCTTCWAAAATCAAGGCrACCAAGTCGAAAAGGAAAAO 

AGACTCAGTTGCAACTTrCCAAACCTTCCTGACCA^ 

ATTCTGGTTTTGTCTCTCrGGTTGAGAGAACGGCTCTATGAAGTGGGGATTTTACTTGCAC^ 
40 GCTCGATCTTCCTACAATTCTGTTTAGAGGTAGTTTTGGTATCTCTTXKjAGC^ 
AACGCAATCACAACTTACXTACTCCAAACTCTACT^ 

AAAGCAAGCAGTrrATCAACTAGCATCTTATCTTTTGCAGAATCCTAT O 1 I'll 1 CTAGTTCTGCTT AGTTGCTT ATC 
TGTAGCCCTTTGTTTCCTATTCTTATTTAGAAAATCACCG 

45 4142.5 

ATGTTACACAACGCATTTGCCTATGTTACAAGGAAGTTTTTCAAA 

GGCGAGCTTGAGTTTGGTCGGCTTGTCAATCAAGGGAGCTACrGCCAAGGCTTCrCAGGAGA 
ACCAATAGCTTCTCCATGCAAATCAATCGTCGCGTCAACCAAGGAACGCCTCGTGGTGCTGGGAATATCAAGGGT 
GAAGACATCAAAAAAATCACCGAAAACAAGGCCATTGAGTCn'ATGTCAAACGTATCAACGCTAT CGGA GATTTG 
50 ACTXjGATATGACCTGATTGAAACGCXAGAAACCAAGAAGAATCTCACTGCTGATCGTC 
AGCTTGATGATTACAGGTGTCAATGACTCCTCrAAAGAAGACAAGTTTGTCT 

GAGAGCACITAACCAACGACGACAAGGATAAAATCCTCrTCCACAAGGACTTGGCAGCCAAACACGGCT^ 
GTAGGGGACAAGGTTAAACTCGACTCTAATATCTACGATGCAGATAATG 

AGTGACAATCAAGGGACTCTTTGATGGTCATAATAAGTCAGCAGTAACCTACrCACAAGAACTTTAC 
55 AG CTATT ACAGACATTCACACTGCTGCAAAACTrrATGGATACACAGAAGACACAGCCA 

CI ICI'M CTAACAGCAGACAAGAACTTGGATGATGTTATGAAAGAGTTGAATGGCATCAGTGGTATCAACTGGAA 

GAGCTACACACTCGTCAAGAGCTCCTCTAACTACCCAGCrCTrGAGCAATCrATCTCrGGTATGTACAAGATGGCC 

AACCTCCTCTrCTGCKXTTAGCTTGAGCTTCTCACTTTCTCCTCOT 

TCGCAAGGAAGTGGGAATTCTCCTCTCTATCGGCCTCAAGCAGGCAAGTATCTTGGGTCA 
60 ATTCATC^CCGAATCTATCTTGATTGCTATCrcrccrCTAGrrTCTGCr^ 

CAATTGGAAACACTGTCCTTGCCAATCrrGACTTCAGGTCrrTGCCAAACAGGCTAGTAA 
ACCT TGCTGGTGGTGCVGAAGTAGATGGCnTAGCAAGACCTTGTCGAGCCTAGACATTTCCATTCAGACAT 
CTTTATCATCA 1" riTl GTCCTTGCCITGGTTCTAGTGGTrcrCGTTATGGCGCrrGCrTCAAGCAATCrCCTTAGAA 
AACAACCAAAAG AGCTCTTGCTGGATGGTGAATAA 
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4144.1 

ATGTCACAGGATAAACAAATGAAAGCTCrrrc 

GGGTTGGOAGTTTGATTTTCTGTATTTGGGCTTATCAGGCTGGGATTTTACAATCC^ 

ATCCAGCAGGCAGGCATCrGGGGTCCACCTCTCnTATL 1 1 1 1 1 ACAGATTTTACAGACTCTCGTCCCTATCATTCC 
AGGGGCCTTGACCTCGGTGGCTGGGGTCTTTATCTACGGGCACATCATCGGGACTATCTACAACT 
CTGATTGGCTGTGCCATTAT C 1 ' 1 ' I ' l A TGrAGTGCGCCTATACGGAGCTGC CU ' IUXj 1 CCAGTCTGTCGTCAGCAAGC 
GCACCTACGACAAGTACATCGACTGGCTAGATAAGCGCAATC UlTntM CCG ClTCl 1 1 ATTTTTATCATGATTTG 
GCCCATTAGCCCAGCTGACTTTCTCTGTATGCTGGCTGCCCTGACCAAGATGAGCTTCAAGCGCT 
ATCATTCTGACCAAACCCTTTACCCTCGTGGTTTATACCTACGGTCTC ' IT 1 ' C 1 GGCAAAT 
GCTTTGA 

4144.2 

ATGAGAAATATGTGGGTTGTAATCAAGGAAACCTATCTTCGACATGTCGAGTCATGGA O 1 11 I 1 1 CI T 1 ATGGTGA 

TTTCGCCGTTCCTC ITl ' 1 1 AGGAATCTCTGTAGGAATTGGGCATCTCCAAGGTrLTI CTATGGCTAAAAATAATAAA 

GTGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACTGAAGAATGTAAATGGTGTTAA 

GACGAAGCAAGTGCCAAAGAAGCAATTAAAGAAGAAAAATTAAAAGGTTATTTGACCAT^ 

GTTCTAAAGGCAGTTTATCATGGCGAAACATCGCTTGAAAATGGAATTAAATTTGAGGTTACA 

AACTGCAAAATCAGCTTAATCGTTCAACTGCTTCCTTGTCTCAAGAGCA^ 

AATTG\CAGAAAAGATTGATGAAGCCAAGGAAAATAAAAAGTTTATTCAAACAATTGCAG(^ 

T C M ICH I ATATGATTC7rGATTACCrATGCGGGTGTAACAGCTG\GGAAGTTGCCAGTCAAAAAGGCACCAAAAT 

TATGGAAGTC Crr 1 * 1 ' I ' l ' 1 CTAGCATAAGGGCAAGTCACTATTTCTATGCGCGG ATG ATGGCTCTGTTTCTAGTG ATTT 

TAACGCATATTGGGATCTATGTTGTAGGTGGTCTGGCTCrCC Gl 1 1 10C1C1 1 1 AAAGATTTGCCATTCTTGGCTCAG 

TCTGGTATTTTGG ATCACTTGGG AG ATGCTATCTCACTG AATACCTTG C 1T/HT ATTTTGATCACTCTTTTCATGTA 

CGTAGTCTTGGCAGCCTTCCTAGGATCTATGGTTTCTCGTCCTGAGGACTCAGGGAAAGCCTTGTCGC 

ATTTTG ATTATGGGTGG l" 1 " 1" I'l" 1" 1 GG AGTG ACAGCTCTAGGTGCAGCTGGTGACAATCTCCTCTTG AAGATTGGTTC 

TTATATTCCCTTTATTTCGACCTTCTTTATGCCGTTTCGAACGATTAATGACT 

TCAAACGGATGATTTAGGGATTTGGAAAACCTTTAAACGTGCCTTATCTTATAAATAG 

4144.3 

ATGACAGAAACCATTAAATTGATGAAGGCTCATACTTCAGTGOSCACXrm 

GACTTAAATGAGATTTTGACAGCAGCCCAGATGGCATCATCTTGGAAGAATTTCCAATCCT 

TACGAAGTCAAGAGAAGAAAGATGCCTrGTATGAATrGGTACCTCAAGAACCCATTCGCC^ 

TCTCnTTGTCGGAGAmGAACCGAGCAGAAAAGGCAGCCCGACTTCATACCGACACCTTC 

GGAAGGTCTCTTGATTAGTTCGGTCGATGCAGCTCTTGCTGGACAAAACGC C1 1 0 1 1 GGCAGCTG AAAGCTTGGGC 

TATGGTGGTGTGATTATCGGTTTGGTTCGATACAAGTCTGA^ 

CCTATTCTGTCTTTGGGATGGCACTGGGTGTGCCAAATCAACATCATGATATC 

ATCrrrCTCnTGAGGAAGAATACCAAGAACAGTCAACTGAGGCAATCCAAGCTTATGA 

ATGCTGGGGCGCGTGCGACCACAAGCTGGACrrCAGCGC^ 

CTAGAAAAAATCTTGAACAGAAGAAATTATTGTAG 

4146.1 

ATGTTAAAACTTATTGCTATTGTTGGAACAAATTCAAAACGTTCT^ 
AACACTTTACTGACAAAGCTGAAATTGAACTTGTTGAAAT 

GCAAGTACCTGCTGAAATATTGGAAATTGCTGCTAAAATCGAAGAGGCAGATGGCGTTA 

GTATGATCACTCTATTCCAGCTO ITT 1 G ATGAGCGCTCTTGCnXKnTGTCTTATGGTATTTACCCAC I'l l IGAACA 

AACCAATCATGATTACAGGTGOTCTTACGGTACGCmGTTCATCTCGTGCCCAA 

GAATtjCTCCTGAAATCAAGGCAAATGTTXTTTCCAGATGAATTCTTGCr 

AGTGGCG AC 1 1UO lUGACCTTGATGTTATCAAGAAATTGGATGCCATCTTTGATGACTTCCGTATCr 1'IUl AAAAA 
TCACAGAAAAATTACGTAATGCACAAGAATTACTTCGCAAAGATGCTG 

4146.2 

ATGAATACCTATCAATTAAATAATGGAGTAGAAATTCCAGTATTGGGATTT^ 

GAAGAAGCCTATCGTGCAGTGTTAGAAGCCTTGAAGGCTGGTTATCGTCATATTGATACTC 

ATGAAGAAAGTGTrtKJrCAAGCAATCAAAGATAGC^^ 

GGAATAGTCAGCAAACCTATGACCAAACTCCTCAAGCnTGGAAAAATCTATAGAAAAACTC 
TGGATTTGTATTTGATTCATTGGCCGAACCCAAAACCGCTCAGAG 

AAU'll 1 OGAG AGCG ATGGAAGACCTCTATCAAG AAGGGAAAATCCXjTGCTATCGGCGTTAGCAA I'l Tit. 1 1CCCC 
ATCATTTCGATGCCTTGCTTGAAACroCAACTATCGTTCCTCCGCT 

TCAAGATCAAGTCGTAGCTTACTGTCGTGAAAAGGGAATTTTATTGGAAGCTTGGGGGCC^ 
ACTGTTTCATAGCAAGCAAGTCCAAGAAATAGCAGCAAATCACGGAAAATCGGTTGCT 

GAGCTTGGCAG AAGGA 11 111 ACCACTTCCAAAATCTGTCACAACCTCTCGTATTCAAGCTAATCTTGATTGCTTT 
GGAATTGAACTGAGTCATGAGGAGAGAGAAACCTTAAAAACGATTGCTGTTCAATCGGGTGCTCCACGAGTTGAT 
GATGTGGATTTCTAG 
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4147.1 

ATGACXTTCCAAAATGCTTGATCCAATTGCrATT 

TGA CAGG CTTGATTCTTG CGGT rTATITGACCATGAAAGAAGCACCTAGAAAGAAGATCATACCA^ 
AGATTTTATCTTAGTAGCCTTTCCCTTGGCrATTTTAGGAGCTCGTCTCTACT 

TAGTC AGAATTTA GGAGA GATTTTTGCCATTTGGAATG G 1 Uj 1 ' r I G GCCATTTACGGTGGTTTGATAACTGGGGCr 
CTTGTGCTCTATATCTTTGCTGACCGTAAACTXIATCAATA 
GATTCCTCAAA GTTT GGCGCGTTGGGGTAATTTCTTTAACCA^ 
TATCTACCTGGCTTTATCCGTGACCAGATGTATATT^ 

TAT GGAAT CTGCTTCGCTTTGC CTTGA TTCTGATTTTrAGACGGAAATGGAAGAGTCTCAGACG 

G GCCm t i ACTTGATTTGGTATGGTTTCGGTCGTATGGTTATCGAAGGTA 1 0 1 ic nvG 

GCTTTCGAGTGTCCCAATGGCTGTCACTTTGTCCTTATCGGTCTCGGTA 

GAAGGCCCCTTACTATATTACAGAGGAGGAAAACTAA 

4147.2 

ATGGGT AA ATTAT CCTCAATCC 1 11 ' 1 AG G AACC GTTTCAGGTGCAGCTCTTGC L' 11 0 1 ' I I'l l AACAAGTG AT AAGG 
GCAAACAAGTTTGCAGTCAGGCTCAAGATmCTAGATGATTTGAGAGAAGA 

TCTGTGAAAAACTGACAGAAGTTAAGGAGCAGGCTACAGA 1 1 1 l OnC l ti AAAACAAAAGAACAGGTTGAGTCAG 
GTGAAATCACTGTGGACAGTATACTTGCTCAAACTAAATCCTATGCTTTTCAAGCGA^ 

AATTAAATAATCTCAAGGAGCAATCGCAAGAAAAAGCCGAAGCTCTTGATGACTCAGAAGAGATTGTGATTGAT^ 
TAACAGAAGAATAA 

4147.3 

ATGAAAACTAAATTGATCTTTTGGGGCTCTATGCTCTTTCTCCTCT 

TTGGA TTTT CTATCCTATGCAGATTCAGTGGCTAAACTTAACGAATCGAGTCT 

TACAATTTTCATATCTTGATGAATTATCTGACCAATCCTTTTACnX> 

AGCAGCTCGTCTG CACCA TTTCGCAGTGGTCAAGAATCTCTTTCATTTGGTrCAGCTAGTAGCT 
CCAA GTTTC TATGTCTTTGTCAATAGGATTGTGAA AAAG GA L 1 I T H Vl 'CTCTTTATCG AAAAAGTCTCCTGGCrCT 
AGT AGTCT TACCrGTGATGATTGGAC TTGGGG GAGTTTTGATTG Ci I riM' GACC\ATTCTTTACTCTTTTCCATCAAA 
TTCTci i iuTGGGAGATGA TACCTGG CTTTTTGATCCAGCCAAGGATCCTGTTATTATGATTTTGCCAGAGACCTTC 
1 1 iui i CATGCCTTCCTCCTCITITTTGCCCTCTATGAAAAC I lLTntXJCTATCTGTATCTGAAAAGTCGTAGGAA 
GTGA 

4149.1 

atgacttatcattttactgaagaatacgatattattgtaattggtgcgggacacgctggg 

ccgctagccgtatgggctgtaaggtcctgcttgcgaccatcaatattgaaat^ 

ctctatcggtggttctgccaaggggattgtcgtgcgtgaagtcgatgccctcggtggcgagatggccaaaacca 

gacaagact taca tccacatgaagatoctaaacacaccgaaggggccag 

gacaaggaactttactctaaggagatgcgcaagacggttgaaaaccaagaa 

a ttgat gagattttggtggaagatggcaaggttgtcggtgtgcgtacagccacc 

gc1g li attgtgacgacagggactgctctccgtggggaaattatcatcggagaccrcaagtactcatcaggtccta 

accacagcttcgcttctattaacctagctgacaatctc^ 

aacccctccacgtgtcaaggcttcttctatcaattacgatgtgacagaaattcagccaggagacgaagtccct 

catttctcatacacttcacgtgatgaggattatgtcaaggaccaagtaccatgctggttg 

cc^gtcatgagattatcxaaaacaacctccaccgtgcgcctatgt^ 

gttactgtccgtcgattgaagacaagattgtgcgctttgcggacaaggaacgtcaccaactcttc 

aggckxicaatactgaggaacrrctatgtgcaaggactttcaa 

gcattccatcaaaggtttggaaaatgcagagatgatgcggacaggttatcctattga 

gtatgatatggtcttgcctcatcagttgcgtgcgacntggaaaccaagaaaatc^ 

cagacaaatggaacatcaggttaotaagaggcagcaggccaaggcattatcck:ggct 

aatccaaggcaagcctgaatnjattttgaagcgcagtgatggttatatcggggtgatga 

caagggaaccattgaaccctaccg fclcl 1 uaccagtcgtgctgaataccgtctcattcttcgtcatgacaatgct 

gatatgcgcttgactgagatgggacgcgagattggccttgtggacgatgaacxktgggct 

AAAA ATCAATTTOATAATGAGATGAAGCGCCTAGACAGTATCAAACTCAAGCCAGTCAAGGAAACCAATC 

CrTTGAGGAGATGGGCTTCAAACCCTTGACCGATCCAGTGACAGCCAAGGAATT 

ACCAAGATGTGGTGGCCTTCATCGGACCAGCTGCAGAAGACTTC 

AAATCAAGTATGAAGGCTATATTTCCAAAGCCATGGACCAGGTTGCCAAGATGAAACGCATGGAAGAAAAACGCA 

ttccggccaatatcgactgggatgacattgattctatcgcaaccgaagcccgtcagaagttcaaactcatcaatcc 
agaaaccatcggccaagccagccgtatttcgggagtaaacccagcagatatttctattttgatggtgtatctggaa 
ggtaaaaatcgtagtatttctaaaactcttcaaaaatcaaaatga 



4149.2 

atgaaagtattagcttttgatacgtccagcaaggctctttctctggctattttaga 
agacgacgattaatattaagaaaaatcacagtattactcttatgcctgccatcgattttrtgatg 
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TTGGACACCCAAGGATTTGGACCGAATCGTGGTAGCTG 
AGCAACTGCTAAGACCTrAGCrCACACCCTGAAC\TCGAC7TTCXrrrGCT 

CATCAACAAG AAGG i i lUi 1 1 GTCCCCTTG ATG G ATGCG CGTCG C AAT AATGTT~ ATCCAGG ATTTTATGAAAATG 
CCAAACCTCnrATGCCAGAAGCGCACCTATCTm^ 

CCITTGTCGGAGAAGTTGGCCC CTTTG TTGAGCAGATTCAAAAACACITGCCAAGG 
GCCCAATGCAGCTAATCTAGCTCTTTrGGCCTGGGACAAGGAAGCAGACTCOT 

CTCAAACGAGTCGAGGCTGAGGAAAACTGGCTCAAGAACCATACCGAGTCTGGCGAGTCTTACATTAAACGCCT 



4149.3 

ATGATAGAAATCAAGCGAATTCAACAACAGCCTGACCTAGCTCAAGCCATCTA 

TAGTCAGTCCTTGGACTCTGGAGCAAATCCAAGCAGATCTGTCCCAAGACCAGACT^ 

TGGGGCAGAAGTGATTX3GATTTCTAGCTGTGCAGGAGAATCTTTTTGAAGCAGAACT 

AGGAGCITATCAGGGTCAGGGGATTGCGTCagCCTTGTTTGCTCAATTGCCGACAGAC^ 

GTCAGACAGTC\AATCAACGAGCGCAAGCATTTTACAAGAAAGAAAAGATGACAGT^ 

CTACTACCATGACCCAGTCGAGGACGCCATTATCATGAAGAGAGAAATAGATGAAGGATAG 

4152.2 

AT GACAA AACAAGTCTTATTAGTGGATGATGAAGAACACATTCTGAAATTGCTTG 
GGCTnTCTACTCAATTGGTGACAAATGGACGGAAGGCCTTAGCTTTGGCAG 
TGCTTGATATCATGTTACCACAATTAGATCKjCATGGAAGTTTGTAAGCGGCTG 
CAATTATGAT^GTTTCTGC^ 

GACGVAGCCTTTTAGCCCTAGAGAATTGCTGGCGCGTGTCAAGGCTGTCCTCAGGCGAACTAAAGGAGAACAAGA 

aggagatgattcagataatatcgctgacgattcttggctatttgggaccttgaaagtataccctgagcgtcatgaa 

gtctacaaggcgaataagttactgagtttgaccccaaaagaatttgaaagcgataaaaatccgtt^ 

tcaaagtttcgaaagtaaccgcccaataa 

4154.1 

atgactacttttaaagatggatttttatggg 

GTGGCAAGGGAATTAGTGTTCCTGATGTTATGACTXjCTGGTCGTCATGGAGTA^^ 

tttagagggtaaatattatccaaatcatgaggcgatagatttttatcaccgttataaaga^ 
gctgaaatgggattcaagtgcttccgtacctctattc 
cgaatgaagaaggattacagttttatgataatctttttgatgaatgcttaaagaatgot 
tctatctcatmgaaatgccttatcactragtg 

tttgctcui i i igcagaagtcgtat ttaaa cgttacaaagataaggttaaatattggatgactttcaatgaaatca 
ataatcaagcgaattatcaggaagattttgcaccatttactaactcaggtattgtatatgagg^ 
agaagca attat gtatcaagcagcacattacgaattagttccttctgcacgagctgtaaaa 
aatccagarrrrcaaataggttgtatgattgcgatgtgtccaatrtatccagttacttc 

AATGGCAATGAAAGCTATGCAGAAGCGTTATTATTTTGCTGATGTGCATGTTT^ 

TTCAAGTATTGG GAAC GAAAAGGTATTTCAGTTGATTTTACn}^ 

TAGATTACATTGGTTTCAGTTACTATATGTCCTTTGCTATCGACTCTCATCGTGAA 

CTTGAAACAGAAGATTTAGTGAAAAATAATTATGTrAAGGCTTCTGAATGCKjACT^ 

TTGCGTTATGCGTTAAATTGGTTTACAGACCACTATCACTTACCACTC^ 

AGATCAAGTTGCAGCAGATGGTATGGTACATGATGATTATAGAATTGAATATCTAGGTGCCCATATTCGTGA 
AAAAAGGCTGTAGTTGAAGATGGTtjrrG ATTT AATGGGTTATACTCCATGGGGATGTATTGA ITlUiV 1 11 CAGCTG 
GTACCGGTGAAATGC GGAAA CGTTATGGCTTTATTTATGTAGATAAAGATGATAATGGGA^ 
GTTCCCCG AAAAAATC ll'V\ GGCTGGT AT AAGG AAGTTATTTCATCTAACGGTGAATCAGT AG AAT AG 

4154.2 

ATGGATCAACAAAACGGOMOl iOMii liCVl G AAAACCATTnTATGGG ACCAATGGGCAAACTTGCTCAGTTTA 

AAGTA GTACG TGCTATC ACGG Cnj CAGGT ATGGCTGCTGTACCATTTACTATTGTAGGATCA A i U i I ' l l ! U GTATT 

CAGTATTTTGCCACAAGCTTTCTCATTTTCGCCAATTGTG 

CACTTTACATGGTTGCAAACTATGCGACTATGGGTTCTCTATCTCTTTATTTCGT^ 

ACX^AAATTTATGCAGAGGAAGAAGAACTCAATATGAATCCTCTrAATGGTC 

TCATGACAGTACCGCAAATCATTTTTGATGGTGGAATGATGAAGACTGTGACAAGTCTAAAA 

TTGCAGATGGATGGGCAATGOT 

CAA TTGTG ACTUi m 1 ATTTA TCGTA TGTGTGTT AAACATAATTGGGTTATTAAAATGCCTG AAGCTGTTCCAG AA 
GG ACTTTCTCGTGGATTTA CCGC' 11 ' 1 IiU l'l CCGGG ATTTGTTGTTGCATTTGTTGTTATCTTTATCAACGGTCn'CTT 
GT AGCAA TGGGAACAGATATTTTTAAAGTCATTGCAATTCCA 1 1 1 OU n ' l ' l ' G TATCCAATCTGACTAATTCGTGGA 
TTGGTTTAATGATTATTTATCTATTGACTCAACT 
TTTGTTAGTCCAATTGCTCTTGCTAACATGGCTGAAAATGCTGCTXj 

CT AAT ATTjTTTGTAATTG caggtggttctg gtg caacttt agg act atgttt atat attg cttttg cctct aaatct 
gaacagcttaaagcaataggacgagcatctgtagttccagccttatt^ 
tacctattatctat aatcc agccttggctataccatttattttagcaccaatggtractgctactattta 
gcgaattctctaaactttattaagccaattatcgcacaggttccatggccaactccagtagggat^ 
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TAGOGA^GCACATCTTCGACCTCTATTAGTTCCTCT 
ATCCGTGTATATCATCAAAAATTGGTCAAAGAAGAGCAAGCTATCTAA 

4155.1 

ATGAAAAAATTTTATGTAAGTCCAATTTTTCCTATTCTAGTAGGATTGA 1 1 GIU I'll OG AGTCTT ATCC ACTTTCAT 
TA IT nTGTl'AATAATAATCTGTTGACGO 1 1 1 1AA 1 1 1101 UlU U \ G TAGGAGGCTATG i 1 1 1 ATTTAAGAA 
ACTOAGAGTGCArrATACAAGGAGTGATGTAGAACAGATACAGTATCTAAACCACXA^ 
AGCrCTATTGGAACAGATGCCTGTAGGTGTTATGAAATTGAATTTATCTTCT^ 

T ATGCTG AATTG ATTTTG A CC AAGG AAG ATG GTG ATTTTG ATTTAGAAGCTGTTCAAACG ATTATCAAGGCTTCAG 
TAGG AAATCCGTCTACTTATGCC AAGCTTGGTG AGAAGCCTTATGCItnTCATATGG ATG C 1 ' ^ CI ' l CCCGTGTTTT 
GTATTTTCrrAGATGTATCCAGGGAACAAGCCATAACAGATGAArrGGTAACAAGTA 
CT CTGT GGATAATT ATGAT GATTTCKjAGGATGAAACTrCTGAGTCA^ 

AATTTT AT ATCAG AG I'l 'I'l CAG AAAAACA CATG A 1 G 1 1 11 CTCGTCGGGTAAGTATGGATCGATTTTATCTATTTAC 
TGACTACACGGTGCTTGAGGGCTTGATGAATGATAAATTTrCTGTTATTGATC 

AGACAGTTGCCCTTGACCTTAAGTATGGG A 1 1 1 1 CI'l ATGGCGATGGAAATCATGATGAGATAGGGAAAGTTGCTT 
TG CTCAATTTG AACTTGGCTGAAGTACGTGGTGGCG ACCAG O IGG T IGT I'AAGGAAAACG ACGAAACG AAAAATC 
CAGTTTA i'i'Vl tJGTGGTGGGTCTGCTGCTTC AATCAAG CGT ACACGG ACTCGTACGCGCGCT ATG ATG ACAGCTAT 
TTCAGATAAG ATTCGGAGTGT AGATCAGG H ' H ' IG l 'AGTCCGTCACAAAAATTTAGACATGGATGCTTTGGGCrCT 
GCTGTAGGTATGCACTTTGTTCGCCAGCAATGTGATTGAAAATAGCTATGC i L 1 1 1 ATGATGAAGAACAAATGTCTC 
CAGATATTG AACG AGCTGTTTCATTCATAG AAAAAGAAGG AGTTACGAA G 11 G II G 1 1 1 f G I T AAGG ATGCAATGG 
GGATGGTGACCAATCGTl ClTl G i'lGATTCTTGTAGACCATTCAAAGACAGCCTTAACATTATCAAAAGAATTTTA 
TGATTTATTTACCCAAACCATTGTTATTGACCACCATAGAAGGGATCAGG A 1 ' IT 1 L 'CAGATAATGCGGTTATTACT 
T ATAT CGAAAGTGGTGCAACTAGTGCCAGTGAGTrGGTAACGGAATroATT^ 

CGTTTGAGTCGTATGCAAGCAAGTGTCITGATGGCTGGTATGATGTTGGATACTAAAAATTTCACCTCGCGACT 

CTAGTC GGAC ATTTGATGTTGCTAGCTATCTCAGAACCCGCGGAAGTGATAGTATT^ 

GACAGATTTTGAAGAATATCGTGAGGTCAATGAACITATTTTACAGGGGCGTAAATTAGGTTCAGAT 

GCAGAGGCTAAGG ACATG AAATGCTATGATACAO 1TG H' ATTAGTAAGGCAGCAG ATGCCATGTTAGCCATGTCA 

GGTATTGAAGCGAG 1 1 1 1 G 1 1 C 1 1 GCGAAGAATACACAAGG ATTTATCTCTATCTCAGCTCGAAGTCGTAGTAAAC 

TGAATGTACAACGGATTATGGAAGAGTTAGGCGGTGGAGGCCACTTTAATTTGGCAGCAGCTCAAA 

TAACCTTGTCAGAAGCAGGTGAAAAACTGACAGAAATTGTATTAAATC 

TGA 

4156.1 

ATGAAAC AG AAAAATATGTGG AAAG AATTGTTGAATCGTGCAGGCTGG A 1 ' MTG GT Cn I ' l l ACTTGCCGTCCTTT 

TATATCAGG^CCCCTAGTGGTTACCTCTATITTCACTTTAAAAGAAGTAGCCCTGCT 

TGCrGGCCTTTCAATTGTGGTrCTGGCTCTATTTATT 

CTTnTTTAGAGCTAAAGArTTGGCACGm 

ATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGATTAATGATAT 

TGAT TTCCA G 1 ' I ' l C 1 ' 1 ' C n GCTAGC t ' l ' IXiCl ' I 'GCTCCGATTTGTGAGGAAAT L ' 1 ' 1 U f GTCGTGCGATTGTTCCTAAA 
AAGArTTrCCGAGGCAAGGAGAACTTGGGATTTGTAGTCGGTACGA l'lXi ITj 1TIGC1 U ATTGCATCA ACCAA GTA 
ATTTACCTTCTTTATTGATTTATGGAGGTATGTCGACAGTTCTATCTTtKjACAGCCTA 

AATGTCGATCTTGCTTCACATGA l 'l Gl ' l AATGGGATTGCTTTCT G ITIUTIXj C ITI ' I 1 G TGGTGATTATG AGTCGG A 
CATTAGGAA1 1 1 (JIG 11' 1AAATGAAAG AG AAAAATATGTGGAAAGAATTGTTGAATCGTGCAGGCTGG ATTTTG 
LI I ' l^l ACrTGCCGTC Cll ' l ' r ATATCAGGTTCCCCTACrrGGTrACCTCrA l 1 1 1 GACTTTAAAAG AAGTAGCCCTGC 
TACAGTCA GGGCT G ATAGTT GCTGGCCTTrCAATTGTGGTTCTGGCTCTATTTATTATGGGAGCT 
GTTAGCTA G ' ri ' n 'AA l ' l ' l ' l ' l CI I ' l 11 1 1 AGAGCTAAACATTTGGCA (JGTn\i GGCTTGAGTTATCTAGTTATTGTCG 
GGTCAAATATACrrGGTTCCATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAAC 

TATGGTTCAAAATAGTTCGTTG ATTTCCA G ' I ' l ' lCTrCIT GCTAGC C I T UC i ' l G CTCCGATTTGTG AGGAAA TCITG T 
GTCGTCGGATTGTTCCTAAAAAG ATTTTCCG AGGCAAGG AGAACT7GGG A 1 1 1G1 AGTCGGTACGAT1U1G1 1 IOC 
TTTATTGCATCAACCAAGTAATTTACC 1 1 (JIT I ATTGATTTATGGAGGTATGTCGACAGTTCTATCTTGG ACAGCCT 
ACAAGACCCAACGTTTGGAAATGTCGATCTTGCTTCACATGATTGTrAA^ 
GTGGTGATTATG AGTCGGACATTAGG AA 1 I lUOl H AA 

4156.4 

ATGGATACACAAAAGATTGAAGCGGCTGTAAAAATGATTATCGAGGCTGTAGGAGAGGACGCTAATCGCGAGGGC 
TTGCAGGAAACACCTGCTCGTGTAGCCCGTATGTATCAAG AG A I ' 1 111 1 CAGGTCTTGGTCAAACAGCAG AGG AAC 
Al 1 1 KS 1 ' CA AAATCCTT roAA ATTATTGACG ATAATATGGTGGTAG AAAAGGATAT L ' l ' 1 ' 1 ' 1 1 CCATACCATGTGTG A 
ACACCACTTCTTGCCATTTTATGGTAGAGCGCACATTGCCTACATrcCAGA 
CTAGCCCGTACGGTTGAAGTTTATTCGAAAAAACCACAAATTCAAGAACGTTTGAA 
TTGATGGACTATCTAGGTGCTAAAGGAGCCTTTGTTGTCATTGAGGCGGAACATATGTGTATG 
TTAGAAAACCAGGCACTGCAACCTTGACGACAGTAGCTCGTGGTCTATTTGAAACAGATAAGGATCTCCGTGACC 
AAGCTTATCGTTTAATGGGGCTATAA 

4157.2 
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ATGAAACACTTCrrTTTTAAAGAGAA^ 
ACCACTTTCTCTTGTTCCTGCITG^ 
GAAAATCATTGGCCTATCCTnTGT^ 
TATGGAGGCTCCAGACAAGCTCmCTCrrAGTTG^ 
J CATTTCCCTAG TCTTTT GGCTCTTTGTACAGACC L 1 1 1 I L i TXj CTCTTATTTCCGCCTTTA ITTT ' I ' AGCAATGGGTTA 

TGGCTTGCC AGTTTTTCTG CTCT ATGTGCTTTTATTGGGGGTAGG AAAATATTTCCACTTTTCT C CAGCA 

AATTTTTCACTGA^CTGGACTGGACTGGGACT^ 
TTTCTTTGCCCTCTTTACGCAGGTCAAGGGAATTTCAAACAGCGTTAAGC 
AGGCTGTTCAGAAGGTGCCTGGGAAGATTTGGCAAAATCTCTATCTGCGTT 
1 0 TGCTCTCAGT CTTC GTCTTCTCTTGCTTTCCTrGCTGGCGCAGC) i I 1 1 1 ATCGAGCAAGCTTGG ATTGCG ACAGCAG 

TGGTA GTTC TCTTTAACTACCTCTTGCTCrrCCAGTTtKnTXjCC 

CAA CTCTT TCCGCTGGACAAGGGGCAAAAGGAAAAAGGCTTACAGGAGGTAGTTCGAG^ 
TT ACTTG TGGAATTAG HG 1TGGGTTGATTACCTTCCAAGAAAAACTAGCCCTTCTAGCCTTACTA 
TGGTTTTACrAGTCTTGTATTTGCCTTATCAGGTAAAACGTCAGATGCAGGACTAA 

15 

4158.1 

ATGAGAAAATCAATAGTAT TAGCG GCAGATAATCCCTATCTTATTC 

ATCACAATAGAGATGTTG A 1 1' l' ri'ATATTCTCAACAGTGATATAGCTCCTGAATGGTTTAAATTATTGGGG AGAAA 
AATGG AAGTTGTG A^TTCT ACAATTCGCAOTGTACACATTG ATAAAG AAC 1TTTT GAAAGCTAT AAAACAGG ACCT 
20 CATATAAATTATGC ITU 1 1 ACTTTAGA III' lU'lliCGACAGAAGTGGTTGAATCTGATAGGGTATTGTATCTGGATTC 

CG AT ATCATTGTAACTGGGGAACT AG CTACTTTGTTTC AG AT AG ATCTCAA^ 

GATGTCTATGCCTATGAAGGACGAAAATCTTKjATTTAATACTGGTATGTTA 
AACATrCTATTt3TCAATACnTrATTGGAATTAGCGGCCGAGCAG 

TTTAAATATTTATTTTGAGGATAATTGGCTAGCCTTAGATAAAACATATAATTATATGCT 
25 ACCTTGCTCAAG AATGTG AACGTCTAG ATG ACAATCCACCTACAA 1' I tf IT CACTATGCrAGTCATG ATAAACCTTG 

GAATACATATAGTATATCTAGACTACGTGAATTATGGTGGGTTTATAGAGATTTGGArrGGTCAGA 

CAACGTTCCGATTTAAATTATTTTGAAAGAAGCAATCAGTCrAAAAAACA^ 

GATATAAAACATTTAGAGTATTTAGTACAACGGTTACCTGATTC^ 

TGAGGAGCTGACCTCTCTATCAC^GTATACGAATGTAACAGTATATCAAAATGTATTA 
30 CTATTGGACGATTCTATAGTTT ATTTA GATATTAATACAGGTGGAGAG O 1111 1 AATGTAGTTACAAGGGCACAAG 

AAAGTGGCAAG AAAATCTTCGC 1 1 1 ' l'l G ATATCACACGTAAAAGTATGGATGATGGACTCTATGACGGT ATT ITTT C 

TGTGGAGAGACCAGATGATTTAGTGGATAGAATGAAGAATATAGAGATAGAGTAA 

35 ATGACTAAGATTTATTCGTCAATAGCAGTAAAAAAAGGACTATTTAC 

GAAGTCGTATTATTCTCC UTTT UTT 'GACCTAAATACTAAAG ATTTT ' IT 'AGGAGGTTCAACAGCCTATCTACCCTTC 
TCA GCCG CCCTAACA GGTGGGAATCT AAGAAGTTTATCAA rn ' n ' r CTGTTGGATTATCCCCTTGGATGTCCGCCA 
TGATTTTATGGCAGATU ITT I CITTTT'CTAAACGGTTGGGTTTAACATCTACGTCTATAGAAATACAAGATCGCCGT 
AAAATGTACCTGACCnTGCTAATTGCTGTGATTCAATCCITGGCAGTTAGCTTGAGACTGC 

40 ATTCTGCAATATTG G 1 ' 1 G ITCTAATGAATACAATATTGCTG ATACCAGGAACA ITTT ' U U ' 10 1 1 ' 1 GGTTGTCAG AT 

TTAA ATGCGAGTATGGGGATTGGAGGTTCTATTGTAATCCTCCTATCCAGTATG Ol 1 1 1 AAATATTCCTCAGG ATG 
TTTTGG AAAC^TTTCAG ACAGT ACACATTCC\ACAGGG III 1CT 

TATTTACTTGCCCTTATGTATCGAGCTCGCTATTTGGTTCCTGTTAATAAAATTW 

CTA1TCTTATCTCG AAATCATGTrG AATCCTGGVGGTGGGATGCCTTATATGTATGTG ATGA U 1 ' ITT U 1 AGTGTAC 
45 CAGCTTATTTGTTCATCTTGTTGGGATTTATTTTCCCTAATCAT^ 

GTTGGAAAGCCTTTCrrGGGTCTATGTTTAT ATTTCGGTCTT A Mill ATTTAGTATC A 1' ITTT G CT TTT 0' IT ACG ATG 
AATGGAGAAGAGATTOC^GACCCTATGAAAAAATCrGGAG 

CAT GUTT TTOl 1 11011 1 G ATG AAAAGTT ATTACG ATTGGCAATG ATTCCAGGCTT^ 
50 TGATTmACGATTAGAGACGAGGTCAAGGCmAvVGGCTAAATGAGACCTATAGACCm 

4158.3 

A TG fCUU C 1 1 IT C GOATCAA GAAT TAGTAGCTAAAACAGTAG AGTTTCGTCAGOj IV V V I CCG AGOG AG AAAGTC 
TAG ACG AT A 1 1 1 \XMJ ITUAAGCl ITT GCTGTGGTGCGTGAAGCAGATAAGCGGATnTAGGGATGTTTCCTTATGA 

55 TCTTTCAAGTCATGGGAGCrATTCrrCATGCACTATGGAAATGTrGCT^ 

GACAGCTACCATGCCTGTCTATTTG AACGUT T T T XTAGG AGAAGGAGTG ATGGTTGTGACTCCTAATG AGTATTTA 

TCAAAGCGTGATGCCGAGGAAATGGGTCAAGTTTAT LXi 1' ITT U 'AGG ATTGACCATTGGTGTACCATTTACGGAAG 

ATCCAAAGAAGGAGATGAAAGCTGAAGAAAAGAAGCTTATCTATGCTTCGGATATCATCT 

ATTTAG O ITTT G ATTATCTAAATG ATAACCTAGCCTCG AATG AAG AAGGTAA G I'll 1 1 ACG ACCGTTT AACT ATGT 

60 GATTATTGATCAAATrcATGATATCTTGCTrGATAGTGCACAAACTCCTCTGATTATTGCGG 

AGTCTAATTACTATGCGATCATTGATACACTTGTAACAACCTTGGTCGAAGGAGAGGATTATATCTT^ 
GAAAGAGGAGGTTTGGCrCACTACTAAGGGGGCCAAGTCTGCTGAGAArrTCCTAGGGA 

GG AAG AGCATGCGT LTT TT G CTCGTC\TTTGGTTTATGCGATTCG AGCTCATAAGCTCTTTACTAAAG ATAAGG AC 
TATATCATTCGTGGAAATGAGATGGTACrGGTTGATAAGGGAACAGGGCGTCTAATGGAAATGACTAA^ 
65 GGAGGTCTCCATCAGGCTATTGAAGCCAAGGAAC^TGTCAAATTATCTCCrGAGACGCGGGCTATGGCCTCGATC 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



49 



PCT/GB99/02451 



ACCTATCAGAGIU I riTAAGAlbl I 1 AATAAGATATCTGCTATGACAGCCACACGTAAGGTCGCGGAAAAAGAG 

TTTATTGAAACTTACAATATGTCTGTAGTACGCATTCCAACCAATCGTCCGAGACAACGGA 

TTGACTATCCACATAATCTATATATCACTTTACCTGAAAAACTTGTATGCATCCnXW 

TGCTAAGGGAAATCCTTTACTC G riU ' I^ GTAGGCTCAGTTGAAATGTCTGAACTCTATTCGTCTCT C r i ' G ' rritJGI G 

AAGGGATTGCCCATAATGTCCTAAATGCTAATAATGCGGCGCGTGAGGCTCAGATTATCTCCGAGTCAGGTCAG 

TGGGGGCTGTGACAGTGGCrACCTCTATGGCAGGACGTGGTACGGATATCAAGCTTGGTAAAGGAGT 

TTGGGGGCTTGAHGIM'ATTGGGACTGA GCGGAT GGAAAGTCAGCGGATCGACCTACAAATTCGTGGC 

TCGTC AGGG AG ATCCTGGT ATG AGT AAA I'l l 1 1 IGTATCCTTAGAGGATGATGTTATCAAGAAA 1 1 IGGICCATCT 

TGGGTGC^TAAAAAGTACAAAGACTATCAGGTTCAAGATATGACTCAA 

CGGAAACTACTTCGAAAAGGCTCAGCATGCCAGTGATACrrGCrGGACGTTCAGCAC 

GCTGAAAGTATGAATATACAACGGGATATAGTCTATAAAGAGAGAAATCGTCTAATAGATGGTTCT 

GAGGATGTl'G 1'lGTGGATATGATTGAGAGATATACAGAAGAGGTAGCGGCTGATGACTATGCrAGTCGTGAATTAT 

TGTTTCACTTTATTGTGACCAATATTAU I' 1 11 CATGTTAAAGAGGTTCCAGATTATATAGATGTAACTGACAAAACT 

GCAGTTCGTAGCTTTATGAAGCAGGTGATTGATAAAGAACTTTCTGAAAAG 

TATATGAACAGTl" 1" I" I ACG ACTTTCAClXiClM'AAAGCCATTG ATG ACAACTGGGTAGAGCAGGTAG ACTATCTACA 
ACAGCTATCCATGGCTATCGGTGGTCAATCTGCTAGTCAGAAAAATCCAATCGTAGAGTACTATCAAGAAGCCTA 
CGCGGGCrrTGAAGCTATGAAAGAACAGATTCATGCGGATATGGTGCGTAATCTCTO 
CACTCCAAAAGGTGAAATCGTGACTCATTTTCCATAA 

4128.4 

ATGATAGGGACTTTCGCCGCTGCTCTTGTAGCTGTACTAGCAAATTTCATCGT 

GTGCCAATACTCAAATTGCACCACCAGATGGGATTGGGCAGGTTCTCAGGAACL'IV 

ACCCAGTCAACGCCCTGCTTACTCCTAACTATATTAGAATCTTATCTTGC^ 

AGAAGCCAGTAAAAATAGTCAAGAATTGCTAAAAACrATCGCTGACGTGACTTCTAAAATTGTCGAA 

CAATCTGGCTC CATTTGG AATCCTTCGTC 11011111 AAAACCATTTCTG ACAAGGG AGTCGG AAGCCTTGCCAAC 

TACGGTATTTTATTGGTTCTATTAGTAACGACTATG C ' 1 ' IT ri GTTGCCCCTGTGGTCAACCCTTTGATTGCCTTCTTC 

TTrATGAGACGCAATCCTTACCCTCTAGTTTGGAACTGCCTCCGTGTCAGCGGTGTGACAGCCrri 1 1CACTCGTA 

GTTCTGCGACTAACATrCCTGTCAACATGAAACTCTGCCATGACCTTGGACTC^ 

TATtXXACTCGGTTCTACTATCAATATGGCTGGAGTAGCGATTACCATTAACCTTTrGACC 

ACTCTTGGAATTCCTGTTGACTrrGCCACAGCCTTTGTCCTGAGTGTGGTA 

AGGTATTGCCGGAGCrrrCCCTCCITCTTATCCCAGTTGCTTGTAGCCTTTTC 

AAATTGTTGGGGTTGGTTTTGTGATTGGTGTCATCCAAGACTCATGTC 

CCTCTTTACCGCCGTTGCCGAATACGCAGCAACCCGTAAAAAATAA 

4158.5 

ATGTCTATTAGCCAACGTACGACCAAGCTCATCTTAGCTACCTGTCTTGC 

TTCGTCAGCAGTTTCGGCTGGAATTATCGCTCTCTTGAGCCTATCTGATACGCGTAG 

CGCAATCGT L ' 1 ' I ' I ' H T CTATGCTTCTAGCTCTGGCTATCGGTGTTCTAGCTTTTCACTTG 

GAGTCTCGGCCTCTATCTGGCCTTCTACGTTCCTTTAGCCTACAAGATGGGCTGGGAAATTGGCATC^ 

ACTGI 1 1' I GGTTAGCCATCTCTTGGTTGAAGAGTCAACCTCTCCAGACCrrCTAGTCAATGAA 

TATTGGTACAGG ATTTGCCTTG C 1 1G1 1 AATCTCTATATGCCTTCACGAGAAGAGGAAATCCAGCACTACCACACG 

CTGGTGGAAGAAAAGTTAAAAGATATCCTCCAGCGCTTCAAATACTA 

GCACAGCTGGTAGCAGAATTAGACACGCTTTTGAAAGAAGCCCTCAGACTGGTCTATTTOT 

TCTTTCACCAGAGAGACTACCATATCCACTACTTTGAGATGAGACAGCGACAAAGTCG TATCC TGAGAAACATGG 

CCCAACAGATTAACACTTGTCACCTTGCCGCCAGTGAAAGCCrGATCTTAGCGCAACTCTTT^ 

TCAACTGAGCCAGACCAATCCTGCTTCTGATTTGCTAGATGAAATTG 

AGTCTGCCCAAGACAAGAGAAGAATTTGAAACCCGCGCC^CCCTTCTTCAACrCCTACGT 

ATCCAAGT AAAAGTTG A ITl'l'l ACCAAAAATATAG ACAGT AA 

4158.6 

atggaaatcatgtcgcttgcgattgct o 11111 g ccgtcatcattggttt agtcatitx} at atgtcagcatctcagc 

taagatgaaatcatctcaggaagctgcagagttcatgcttttaaatgctga^ 

acaagctgagcgtgaagcggattta lt1 g 1 1 aatgaagccaaacgtgaaagcaagtctcttaaaaaagaagcact 

atttkjaggccaaagaagaagccagaaaataccgtgaagaagtggacgctgaattcaaatcagaacct 

tcaaacaaatcgaaagtcgtttgacagagagagctactagccttgaccgtaaggacgacaatttc 

aacaaacaotgaacaaaaagaacaaagtattrctgatagagcgaaaaaccttgat^ 

aggaagtcgaaagacaaaaagaagcagaactagagcgtattggtgcgctgtct^ 

atcttggctcaoacagaggaaaacttgaccagggagattgccagtcgcattcgcgaagctgagcaagagg 

g aacg t t ci u acaaaatggccaagg acatcttggttcaag ct atg c aacgtatcgctggtg aat atgt agcgg ag 

tcaacaaactcaacagtrcatctgccagacgatactatgaagggacgcatratrggtcgtgaaggtcgtaacatt 

cgtacctttgaaagtttcacaggggtcgatgtgattatcgacgatacaccagaagtg 

atccgattcgtcgtgagattgcccgtatgactatggaaatgttgctcaaagatggtcgtatacatccagctcgtat 

cgaagagttggttcagaaaaaccgtcaagagarrgacaataagattcgtcaatacggtgaggctgct 

aattggtgcgccaaaccttcatccagacttgatgaagattatgggacgtttgcagttc^ 

aatgttttgcgccattcgattgaggttgctaagttggctggtatcatggcgaccgaacttc 
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TTGCCCCrrCCTGCTGGATTCCTTCACGATATCGGG 

CGGTATGGAATTGGCCCGTAAGTACAAGG AACCCCCA U i'lti I GGTGAATACGATTGCTAGTCACCACGGAGATGT 
TGAAGCTGAGAGCGTGATAGCAGTTATCGTCGCTGG\GCAGATGCCTTGAGCGCAGCCCGTCCAGGTGCTCGTAG 
TGACTCTCTTGAAAGCTACATCAAGCGTCTCCATG 

AGCTTTGCCCTTCAACCAGGACGTGAAATTCGTATCATGGTCAATCCAGGAAAAATCA^ 

ATCTTCGCTCACAAAGTTCGTAAGAAAATTGAAAACAATCTCGATTA 

CGCGAGCTTCGTGCAGTAGATTATGCTAAATAA 

4158.7 

ATGATGTTAAAACCCTCTATTGATACCTTGCTCGACAAGG 1 1 (JLTl CAAAATATTCACTCGTAATCTTGGAAGCAA 
AACGTGCCCACGAATTCGAAGCAG<nT;CCCCAG<^ 

TAGAAGAAATCGAATCAGGAAACGTTACAATTCACCCAGATCCAGAAGGAAAACGTO 

TCGAAGAjVGAAAAACGCCGCAVUVCAAGAAGAAGAAAAGAAAATCAA^ 

TGGTGAAAAAATTTAA 

4161.1 

ATGTCAGCATATCAATTACCGACCGTATGGCAGGATGAAGCTAGTAATCAAGGAGCl M I ACGGGGCTAAACAGA 
CCAACAGCAGGTGCCCGTTTCG AACAAAACTTGCCAAAAGGAGAACAAGC' 1 1 1 1 1 LAGCTTTATTCACTGGG AACA 
CCAAATGGTGTGAAGGTTACTATCTTATTXjGAAGAATTACrAGAAGCT 
ATAAGATTGCTATCATGGATGGGGATCAATTCGGATCAGACTTTGTGAAGCTCAATCC^ 

CTTATTGG ACCAGTCAGGTACTGAAAACGTAAG AGTCTTTGAGTCTGCrCATATTCl 1 1 C 1 1 1 ACCTTGCTGAG AAA 

TTTGGAGC Cn 1 11 ACCAAGTAATCCTGTGG AAAAGGTAGAACT 1' l'I ' 1 G AATTGGCTATTCTGGCAAGCAGGTGCAG 

CACC 1' IT 1 CTAGGTGGGGG ATTTGG ACA IT I C l'I CAATTATGCTCCTGAAAAATTGGAATATCCTATTAACCG 1 T I T 

ACGATGGAAGTGAAACGCCAGTTGGATTTATTGGATAAGGAATTGGCTCAGAAACCTTATATT^ 

TATACGATTGCAGATATTGCTATCTGGTCTTGGTATGGACAGTTAGTTGVAGGAAATCTTTACCA^ 

AATTCTTGGATGCCTCAAGTTATCAAAATCTAGTAAAATGGGCAGAAAAAATTG 

GTGCCTTGGAAGTAACTTATACAGAAATTAAATAG 

4161.2 

TTGGCAAGCTTGATCACTTCTATCATCATGTTCTATO 1 CUU ! ' 1 1 1XJG ATGTTCTAAG AG ATACCATTCAAAAGATTCT 
CAGTCGGGAAGAAACGGTCATTXjATCCTCTTGGTGCAACTCTAGGAATCA 11 ' 1 L 1 O CAGCGATTA l X i 1 T1X J 1 G GTC 
TATCTCTACAATACTCGCCTCAGT^ 

A(XCTGTTACCTCACTTGGAACCGGCATTGCCATCCTAGCTAGTAGTTTG 

TGCTATCATCATCA Cl 1 1CU t ATCTTGAAGACTGCCTATGATATCTTCATCGA CjI ' CTl ' LCl 1 T AGTCTTTCAG ATG 

GCmGACGACCGCCTCCTCGAGGACrACCAAAAGGCTATCATGGAAATTCCCAAAATCAG 

AAAGAGGTCGCACCTACGGTAGCAACATCTACCTGGATATTACACTAGAGATGAATCCTGACTTGTCTU ITl 1.1 GA 

AAGCCATGAAATCGCGGATCAGCTCGAGTCTATGCTGGAGGAGCGTTTTGGCGTCTTTCATACCG 

GAACCAGCACCTATCCCTGAGGATGAAATTTTAGACAATGTCTATAAAAAATTGCTTATGCGTC 

ACCAAGGAAACCAACTAGAAGAACTCTTGACTGATGATTTTCTCTATATTCGCCAAGATGGA 

AAGAGGCTTATAAGACCAAAAAAGAGTTAAATTCTGCTATCAAGGACATTCAAATTACr^ 

CCAAACTCATCTGCTATGAGTTAGATGGTATCATCCATACCAGTATCTGGCGTCGCCACGAAACCTGGCA 

CTTTCATCAAGAAACCAAAAAAGAATAG 

4162.1 

ATGACAATTAAACTAGTAGCAACGGATATXjGACGGAACCTTCCTAGATGGGAATGGACGCTTTGATAT 
CTCAAGTCTCTCTTGGTTTCCTACAAGGAAAAAGGGArTTACTTTGCGOT 

TAG AAAAATTA 1 ' 1' iX jC ll X i IXil' l CGTGATG ACATTATTTTGVTCGCGG AAAATGGCAGTTTGGTAGAGTATCAAGG 

TCAGG AC l'I U 1 ATG AAGCG ACT ATGTCTCGTG ACTTTT ATCTGGCAAC 1 ' 1 1 1 0 AAAAGCTGAAAACTTCACCTTAT 

GTAGATATCAATAAACTCCTCTTGACGGGTAAGAAGGGFrCATATGTTCTAGATACG 

AACntjAGTCAGCACTATAATGAAAATATCCAAAAAGTAGCGAGTrrcGAAGATATCACAGATC 

TTAO^CCAACTTCACAGAAGAAACGCTGGAAGATGGGGAGGCTTXWGTAAACG 

CCATGACAACTGGCITTC^TCCATTCATATTGTrCTGGACT^ 

TAAAAAACTTGGTATCACAATGGATCAGGTCATGG C nil U G AG ACAATCTT AATG ACTTACATATG ATGCAGGTT 

CTTCGGACATCCTGTAGCTCCTOAAAATGCACGACCTGAAAT^ 

AGGAACGOTC G GTTATAGCTTATATGGAGGGCTTATAA 

4162.2 

ATGCCAGATATAAAATTGATTGCATrGGACTTGGACGGGACCTTGCTGACTACTGATA 
ACCAAGGAAACCTTGCAAGCTGCGCGTGATCGTGGTATCAAG 

ATGGA l' l'I C IT IVTCCATGAGrTAGGGACTG ACGGTCAGG AAGATG AGTATACCATTAC 1 1 1 1 AATGGTGG ATTAG 
TTCAGAAAAATACAGGAGAAATCCTTG ATAAAACAGT C 1 1 1 1 CATATGATGATtrrGGCACGTTTGTATGAAGAAAC 
AGAGAAATTATCACTGCCTCTTGATGCCATCTCAGAAGGAACAGTTTATCAAATCCAATCGGACCAAGAAAOT 
TTATGCCAAATTCAATCCAGCTTTG ACC1" I' I'U' H'CCAGTCG ACTTTG AAGACTTATCTAGTCAAATG ACCTACAAC 
AAATGCGTGACTGCCTTTGCTCAAGAACCCTTGGATGCAGCCATTCAGAAGA l 'I rCT C CAGAATTG TTTGA CCAAT 
ATGAAATCTrTAAATCACGTGAAATGTTGCTAGAATGGTCACOLAAGAATGTTCAT 
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AACTAATCAGCCATCTTGGAATCGACCAAAGTCAAGTGATGCCTrCTCGTGACGAGGCCAATGACCT 
TTGAATGGGCAGGTCTTGGTGTrGCTATGCAAAACG 

CGATGACCAACGATGAGGAAGCTGTCGCCTGGGCTATCGAAGAATATGTGCTAAAGGAGAACTAA 
5 4164.2 

ATGGAAAGTTTACTTATTCTATTATTAATTGCCAATCTAGCrcGTCTCTT^ 

AGG AG AAACACTTAAGTAAG AGCTTGG AGG ATCAGGCAG ATCA i ' l ' lli 1 CAGACCAGTTGGATTACCGCTTTGACC 
AAGCCAGAGVAGCCAGCCAGTTAGACCAAAAAGATTTOGAAG [XjUVIXj ICAGCGACCGTTTGCAAGAAGTGCGGA 
TTGAATTGCACCAAGGTCTGACCCAAGTCCGTCAAGAAATGACAGATAATCTCCTCCAAACTAGAGACAAGACAG 

10 ACCAACGTCTCCAAGCCTTGCAGG AATCAAATG AGCAAC O 111 O GAACAAATGCGCCAGACGGTCGAGGAAAAAC 

TAGAAAAGACCTTGG\GACACGCTTACAGGCTTCCTTTGAGACAGTTTCTAAACAACTGG 
CCTTX3GAGAAATGCAGACACTITGCCCCrrGATGTCGCAGCTC^ 
GATTCTGGGAGAATTGCAACTGGGGCAAATTATTGAAGACATCATGACACCT 
AACGGTTGAAAACTCTAGTGAACGAGTGGAGTATGCCATCAAGTTACCCGGACAAGGCGACCAAGAA 

1 5 TCTGCCAATTGACTCTAAGTTTCCACTGGCAGATTATTACCGCrTGGAAGAAGC 

GAGATTGAACGCTGTCGTAAGTCACrCCTAGCAAGCGTCAAGCGCnTGCTAGGGATATTAGGAA 
GCACCACCTCGGACGACCAATTTTGC A U I ' l ' I ' l U l ' 1 ' l (j i ' lXJ CGACAGAAOGTCTCTACTCAGAAATCGTCCGCAATC 
CGGT C I ' l G ATG ATTTGAGACGGG AAGAACAGATTA I' 1'G I IXjCAGG ACCAACTACCCTATCAGCCCTTCTTAA 
CTCCCTATCAGTTGGTTTCAAGACCrTTAATATCCAAAAGAGTGCCGACCATATCAG 

20 AAGACCGAGTTTGGCAAGTITGGTGCTATTCTGGTCAAGGCACAAAAACATCTC 

ATGAATTATTAAACCGTCGTACCATAGCTATCGAGCGGACGCTCCGTCACATTGAGTTGTCAGAAGGTGAGCCTC 
GCTTGATCTACTCCATTTTCAAGAAAATGAGGAAGAATATGAAGATTAG 



25 ATGAAGATTAGTCACATGAAAAAAGATGAGTTATTTGAAGGCTTTTACCTAATC\AATCAGCrGA 

ACTCGAGCTGGGAAAAACTACCTAGCCTTTACCTTCCAAGATGATAGTGGCGAGATTGATGGGAAGCTCTG 
GCCCAACCTCATAACATTGAGGCCTTTACCGCAGGTAAGGTTXrrCCACATGAAAGGACG 
AATACCCCTCAAGTCAATCAAATTACTCTCCGC(nX}CCTCAAGCTGGTC 
TCAAGTCACCAGTTGATGTCAAGGAAATTCGTGACTACATGTCGCAAATGA 

30 GCAACGGATTGTCCGAAATCTCTACACCAAGTATGATAAGGAATTCTACTCCTATCCAGCT 

CCATGCCTTTGAAACGGGCnTGGCCTATCATACXGCGACCATtjGTGOj 1T1 GGCAGACGCTATTAGCGAAGnTAT 
CCTC\GCTCAATAAGAGCCTGCTCTATGCGGGGATTATGTTGCATGACTTAGCT 

CAGACG^GACAGAGTACACAGTGCGAGGTAAT CITCriXJ GACATATCGCTCTCATTGATAGCGAAATTACCAAGA 
CAGTTATGG AACTCCGCATCGATGATACCAAGG AAG AAGT CU IT i ' lUC I ' l CGTCATGTCATCCTCAGTCACCACGG 
35 CnX3CTTGACrrATGGAAGCCCAGTCCGTCCACGCAITATC 

GCAAGCATGATGATGATGTCAACAGCTCTTGCTTTGGTGGATAAAGGAGAGATGACCAATAAAATCTTCGC^ 
GATAATCGTTCCTTCTATAAACCAGATTTAGATTAA 

4l66 - 2 

40 ATG AGTG AAAAAGCT AAAAAAGGGTTT AAG ATGCCTTCATCTT ACACCGT ATT ATTG AT AATCATTGCT ATTATGG 

CAGTGCTAACTTGGTTTATCCCTGCGGGGGCCTTTATAGAAGGTATTTACGAGACT 
GATTTGGGATGTCCTCATGGCACCGATTCGGGCTATGCTAGGTACTCATCCAGAGGAAGGTTCGC^ 
ACGAGCGCAGCGATTGATGTAGCCTTCTTCATCCTTATGGTTCGTG 
CTXTTTGACGTAGGGATTGCCTCTATCGTGAAGAAGTATAAGGGCCGCGAAAAAATGTT 

45 1110 IT 1G CCCTCCGTCCTACAACTTATGCTATGGGTGAAGAAA 

ATGATGGCCGTTG ti 1 11 1 G ATAGCCTG ACTGGTGTTGCAATT A n ' nXi CTCGGTTCTCAAATCGGCTGTTTGGCATC 
TACTCTCAATCCATTTGCGACAGGTATTGCTTCAGCGACnXjCGGGAGTTGGTACAGG 

CTOATCTTCTGGGTTACCTTGACTGCTCTTAGTA C 1 1 OU 1 ' I ' l O MT ACCGTTATGCGGATAAGATTCAAAAAG ATCC 
GACTAAGTCACTGGTTTATAGTACTCGCAAAGAAGATTTGAAACACTTTAAOT 
50 TCTACACTTAGCAGCAAACAAAAATCAGTTCTL' 1 1 U ri'ATTTGTGTTGACATTCATCTTGATGGTATTGAGCTTCAT 

TCCATGGACAGACCTnjGCOTTACXATTTTTGATGACTTTAATACTTG 

TTGTCGGTTCATCTA C i'iUl UCACTAGGTACTTGGT ACTTCCCAG AAGGCGCAATG C IV I'l ' 1 G CCTTT ATGGGTATC 

CTG A 1 " 1"GG rGT" I" ATTTATGO" ITT I' AAAG AAG ATAAG ATTATCTCTTCCTTCATG AATGGTGCTtjCTG ACTTGCTCAG 

T G TT GC C 1 1 G ATCGTAGCGATTGCTCGTGGTATTCAAGTTATCATGAACGACGGTATGATTACCGATACAATCCTC 
55 AACTGGGGTAAAGAAGGCTTGAGCGGTCTATCTTCACAAGTCTTTATCGT^ 

GTCA1 1 C 1 1 OATCCCATCTTCATCTGGTCTTGCCAGCGCAACTATGGGTATCATGGC 

TCCACTTGGAGAATTTGTAAATGTCCGTCCTAGCTTGATTATCACTGCrrACCAA 

TGATrcCACCAACATCTGGTATTGTGATGGGAGCTCTrGCACTTGGACGTATC^ 

CATGGGCAAACTCGTAGTCGCTATTATTGTAGTGACCATCGCCCTTCTTCTCC 
60 AA 

4166.3 

ATGAAAATAGATATAACAAATCAAGTTAAAGATGAATTTCTTATATCATTAAAAACCTTGATTT CCT 
TACTCAATGAAGGAGAAAATGGAACACCTTTTGGACAAGCAATCCAAGATGTCCTAGAAAAAA 
65 GTCGAGACATAGGTTTCACTACCTATCTTGACCCTAAAGGTTATTACGGATATGCAGAAATCGGTCAGGGAGCAG 
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agcttctggccattctctctcatttggatgttgtt 

agcaactatcaaagacgcctgggtattcggacgtggtgtccaagatgataaaggccot 

tgcagtaaaaagcttgctggaccaaggtattcagt^ 

aaccctctggcgctgcatggcacgctacaataccatcg 

ttttcctctcacctatgctgaaaaagggcttctacaggtcaaacitcatg 

gaagtaggaggcgcctttaacgttcrraccagacaaggccaactaccaaggtctcclct 

ggtctcaaagaagctggttatgattaccaaaccactgaacaaaccgtaacggtr 

aa ggatgctag tcaaggtatcaatgctgtcatccgactagctaccattcttgctcc^ 

tcagl ii ll'l'l gcaacacaagcaggtcaagacxjgcacaggaagacaaatctttggtgatatagcagatc 

ctggtcacctatccnttaatgtcgcaggtctcatgatcaatca 

tcc tgtct tagctcacaaggaagaactagtagagtrgcitacaagatgtgcacaaaactac 

agagtttgactatctagcgcctctatacgtcgcagaagacagtaaactcgttagcacacn}atgo 

AGAAAAG ACTGGCGATAACAGTCCTGCTATTTCATCCGCTGGTGCCA CI ' I ' I ' IbC ' IXXj CACCATGCCAAATTGTGTA 

GCCTTCGGCGCCTTATTCCCAGGAGCGAAGCAGACAGAACATC^GGCAAATGAATGTGC 

TACCGTGCTATGGATATTTATGCCGAAGCCCTCTATCGACTTCCAACTTAA 

4169.1 

ATGTCTAATTCATTTGTCAAGTTGTT AGTCTCTCAATT ATTTG CAAATTTAGCAGAT A 1 ' 1 ' M LT I ' l AG AGTAACAAT 

CATTGCTAAC ATATAC ATTATTTCAAAATCAGTAATTGCGVCATCACTAGTTCCTATC^ 

TTGTTGCGAGTCrTTTAGTTCCGTTGGTTACTAAAAGGTTAGCGCTAAATAGGGTTTTAT^ 

AAGACT ATATTATTGGCGATACTCGTA GGAA TCTTTACCGTAATGCAATCCG 

TTtnTGTTGCAATTTCCATACTAGATGGTTTTGCAGCACCCGTTTCCTATC 

T TGGG TAAGGCTAATrCAGCCTTATCAATGACrGGTGAAGCTGTTCAATO 

TGTTTGCAACAATTGGTCTXjTTACCTACCACGTGTATCAATTTAGTCT^ 

tttcttcctaacgctcaagtggaggtgttagagtcagaaact 

ttgctagaaat cctagat taagactttttgtatcagcaaatttattggaaattt^ 

tccattatac i ' i o' i'l ' l'l ' i u i'aacgg agttatt aaat aaaacggaaj\gttactgggg atattct aat acagcatact 

ctattggtattataattagtggcttaa i ' 1 uc l ' l ' l ' 1 'aggctatctgaaaa g i rcltl uc t gctaaatgggaacccca 

atta ttcacccca aatctaaaaaccatccagaatccttgccttagcitagatccrgg atg o i ' 1 1 c i ' l ' l ' la ccca 

atggo iii i'l 1 ' 1' !m ac tag ac aaaaaag agtttcccctttatggtataagtgtagaaaaaaacacaaaaag aaagg 

aaactcacatgaagagtttaccaaatcatcacttccaa 

tttaacccagtatggtggtcttatcttttttcaggaal 1 1 1 1 ricccagttgaaactaaaagagcggatttctaagt 

atttagtaacg aatgaccaacgccgctactgtcgttattcgg attcagata ixx ' 1 ' 1 gtccagttcctctttcaactg 

ttaacaggttatggaacggactatgcttgtaaagaattgtcagctgatgcctactt 

tccaaaattg ttgg aaggagggcagcnxjcttcacagccaaccttatcccg l'l'l i l u 1 ccag aactgacg agg aa 

acagtccatagtttgcgatgcctcaaccttg aattggtcgaattl 1 1 1 ri'acagtttcaccagctaaaccaactca 

ttgtagatatcgattctacccatttcacmcttatggcaagcaagaago'ltj^r 

tcatggctatcatcctctttatgctttcgaggggaagacaggttattgtrtcaatgcccag 

gttatt gttct g aag aggcagacagctttatcacacct g ' l ' l ' l " ! ' agaacggtttaatcaacttctctttcgaatgga 

tagtggctttgcgaccccaaaattatacgatttaattgaaaaaacack3g 

aaatactgttctgagccgtcttggagacctttccctcccttgcccacaggatgaggac^ 

tccgcctactx^gaaactctctatcaagcaggatcnggtcgcacaagcgtcgtc 

AAOAAGGAAACTTGTTCTACGATCrrrATTrCTCTCGTTACAAATA 

GCTTTATCGTGGACGTGGTCAAGCCGAGAATTTCATCAAGGAGATGAAGGAGGGA l 1111 1 G GCG AT AAAACGGA 
TAGTTCAACCTTAATCAAAAACGAAGTTCGTATGATGATGAG CTGT ATCGCCTACAATCTCTAT C 1 l'l'l 1 CTCAAA 
CATCTAGCTGGAGGTGACTTCCAAACTTTAACAATCAAACGCTTCCGCCAT C 1 U UU 1 C ACGTGGTGCGAAAAT 
GTGTTCG AACAGGACGCAAGCAGCTCCTCAAATTGTCTAGTCTCTATGCCrATTCCGA A I ' lTJ ' 1 ' I ' lT CACCACTTTA 
TTCTAGGATTAGAAAAGTCAACCTGAATCTTCCTCTrTCaTATGAACCACCT 
CATTAA 

4169.3 

AT GATGG AG I'l'I'lT 1X1AACAGCTTCCTCATTT AG AGCCAT ATGGCAATCCTCAGTA ITl ' lG I ' ll ATGTOATTGCTGC 

AA^CTTGCCCATCTTTATAGGTCTCTTTTTC\AGAAAC 

TTGT CACCA TCnTGGTGGGTGGAAAGACCAATCAACTAGCTGCCT^ 

CC 1 UC IT I'lVl' ACAA GCATTATCGAAAAAGCAAGGATGGCAAGTGGGTCTTCTACTTAGTTAU'l'ri'l CTGTCCCTA 
CTrCCG ATTA TCTTTGTCAAGGTGCAACCAGCTATCAATGG AACGCAGT L ' 11 ' lliU 1 GC U ' I ' ll ' l ' I CGGAA I ' l 1C1 1 A 
CCTGACCTTTCGTTCGCrrTGGAATTGTCATCCAGCTG 
CTCCGTTTCUriVlVri'CATGCCAACTTTCTCGAGTGGTCCAATC^ 

GO CTAT TCCTGAGCGAO ATGAGT rGATGGATATGCTGG ATGAATCrGTCCGCTATATCATGTGGGG Cl ' 1 ' 1 ' 1 ' 1 U 1 AT 

AAGTTT ATCCTAGCTCA 1 G 1 1 1 1 AGG AGAG ACCTTACTACCTCCTCTGAAG AATTTAGCCTTGCAGTCAGGTGGCT 

TCT ITAA TCICTATGCCTTGGCAGTrATGT ATACTTTTGGTCTGG AA L ' 1 ' Cr 1 ' L ' l ' l ' ! G ACTTTGCAGGTTATTCTATGT 

TTGCTTTCGCCATCTCAAACTTGATGGGAATCCGTAGCCCT 

AAAGGAGTTTTGGAATCGCTGGCATATGAGTCTCTCCTTCTGGTTCCGTCACITrCT 

GTGTTAACCAGAAAGAAAGTCTTTAAAAATCGTAATGTAACCTCAAGCATGGCCT 

TGGGATTTTGGCATGGTGTGACCTGCrTACTATATCGCCTATGGACTaTTCATGG 
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TGCCTCGGTTCGCAAGAAAAAAACGCT 

GCTGG ATTC AGTTG CTTGG CATGGTTGTCACTTrCCATGTTGTC ATGTTGTCATTCTTAATC 1 ' 1' 1' 1 L' 1 GG ATTCTTG A 
ATAATCTATGGTTTAAAAAATAA 

4169.4 

ATGCTTAAACGCTTATGGATGATCTTCGGACCGGTCTTGATCGCTGGTTTG^ 

CCTACTGAGATGCATCATAATCTAGGACCTGAAAAGCGTTCAG(>GTGGCTACTACTATCGATAGTm 

GAAGTCAAAAAGTCAG AGCACTATCTGATCCAAATGTGCG 1 1 I \ G ntJ CCTTCTTTGGCTCTAGTGAATGGCTTCG 

TTTTGACGCTGCTCATCCTGCG CTATTA GCTGAGAAATACAATCCTrCCTACCXir 

GGAGCTGCATCGCTTAACCAATATTTTGGAATGCAACAGATCTTACCACAGCTC 

GTTATC TCACC TCAGTGGTTCAGTAAAAATGGCTATGATCCAGCAGCCTTCCAGCAGTATT^ 

TGACTAGTTTTCTGAAACATCAATCTGGGGATCAGGCTAGTCAATATGCAGCGACTCG 

AAACGTAGCTATGAAGGACCTGGTTCAGAAGTTGGCAACTAAAGAAGAATTCT 

TGAATTATTGGCTCGTTTTAATGAACGCCAAGCTTCCTTTTTTGGTC^ 

ATAAGCATGTAGCTAAGTATTTAAAAATCTTGCCAGACCAG1 1 ! I'Cl" I'ATCAGGCAATAG AAG ATGTTGTCAAAGC 

AGATGCTGAAAAAAATAmCCAATAATGAGATGGGAATGGAAAATTATTTCTATAATGAGCAGATCAAGAA^ 

TTTGAAGAAATTAAAGGATTCTCAGAAAAGCTTTACCTATCTCAAGTCGCCAGAOT 

TTAACACAGTTTTCTAAATCTAAGGTAAACCCGATTTTTATCATTCCACCT 

CTCGTCTACGAGAGGATATGTACCAACAAACGGTGCAGAAGATTCGCT 

ATATA GCAGA TTTTTCTAAGG ACGGC GGGG AGCL 1 1 1 CI' I ' I ATG AAGGACACCATTCACCTTGGTTGGTTGGGTTG 
GTTGG C I I'l' 1 G ACAAGGCAGTTGATCCTTTCCTATCCAATCCCACACCAGCTCCGACTTACCA 
TTTTTCAGCAAAGATTGGGCGACTTATGATGGAGATGTCAAAGAATTTCAATAG 

4169.6 

ATGGAGAAAAACCTCAAGGCTTTGAAACAAACAACAGACCAAGAAGGCCCAGCAATTGAACCTGAA^ 
GGATACCAAGAC AGTCC AA AATGG TTACTTCGAGGATGCAGCTGTCAAGGACCGG\CCTTCAGTGACT 

taacnxjgcaatcagtttatcctttccttgaagacggcacgtttgaccaact 

ggtaagatgacccaggc tgagtac aaggcttactatacaaaaggctatcatacagatgtgactaagattaaca^ 

actgataatactatggaatttcttcaagcrrcgacaa^ 

ttgacttacaagaaaggcaatcgtggcgtgcgtttcctctttgaagccacagatc 

ATXjTTCAGT TTAGT C ACCACAATGTTGCCCCA 1 ' I Cl'l'I GGAGGCACAAGCCA 

AGAAGCCCTCTTTGAAGAAATGGACAACTGGCG\ACCTACTACCCAGATAACCrATCTGGCCAAGAAATCGCCCA 
AGAAATGTTGGCGCATTGA 

4170J 

ATGAAAGATG GTCAT TTGCTAGCCCATCATATTCG 111 G 1TGAATGGGCGGATTTTTCAAAAGTTACTGAGTCAAG 

ATCCTGAGGCTCTTTATAGGGGTGAACAGGGCAAGATTTTAGCGGTTTTATGGAATAGTGAAACTGG 

CTGCGACAGATATCGCGCTTGCGACTGCACTTGCGAATAATACGCTGACGACTATGATAAAAAAGCTAGAGGAAC 

AAAAGCTTGTAATTGTTAGTCCCrrGTGGAAAAGACAAGCGTAAGAAGTATTrAGTT^ 

CCCAGAAAGAACTG GGGCA TCGTGTCAGTCAGAAATTGGATACTATCITTTACAAAGGATTTT 

TTCACCAATTTGAAGGTmCAAGAAAGAATTTTGGCGAATCTGAAAGAG^ 

4170.4 

ATGACTAATTTAATTGCAACTmCAGGATCGTmAGTGATTGGTTGACAGCT 

GCnTrTGACCTTGTTACTAGCTAT TTTGC TTGCGATTCCCTTGGCTG i 1 1 1 1C1 1 CGCTATCATGAGAAGCTGGCCG 
ACTGGGTCTTGCAG ATTGC AGGT A I'l 'l'l CCAG ACCATCCCCTCTCTGGC CI ' I ' GT i 'GGGGCrCTTTATCC L ' 1 ' i ' Iti ATG 
GGAATTGGGACCTTGCCGGCTTTGACAGCTCTAGTGATTTATGCGATTTTCCCTATTTTGC 
GGCTGAAGGGAATTGATCCGAACCTGCAAGAGGCTGGGATTGCCTTTGGGATGACCAGAT^ 
AATTTGAAATTCCACTCGCCATGCCTGTTATCATGTCTGGG ATTCGGACGGCAGCT ^ 
AACC THSG CGGCCTTGAT TGGT GCAGGGGG ACT AGGTTCCTTr ATT C ^ 

TTQATTTTG ATTGGC OCAC 1 1 ICI 1 CTGCA GTGCTAGCCATTGCCTTTAACTTCCTACTAAAAGTG ATGG AAAAAG 

CAAAATTACGGACGATTTTCTCAGGTTTTGCCTTGGTCGCTTTAT^ 

GTTCAAAAAGAGAAGGAAAACTTGGTTATTGCTC 

AAGTTGCTGATTGAAGAAAATACCAGCATGACTGCGACTGTTAAACCGAATT^ 

AAGCTCTGAAAAAAGGCGATATTGACAT CTAT CCTGAAriTACTGCT 

ACOCAAGGTGAGTCATGAACCAGAACAGGTTTATCAGGTX3GCGCGTGATGGCATTC 

CTATCTCAAACCCATGTCTTATCAAAACACCTATGCTGTAGCTGTTCCG AAAAAG 

AAGACCATTTCAGACTTGAAAAAAGTCGAAGGGCAGTTGAAGGCAGGTITTACACTCG 

GATGGAAATAAGGGCTrGCAATCAATGTATGGTCTCAATCTCAATCT^ 

AGGCT ATTCAGTCAGGGGATATTCAAATCACGGATGCCTATTCGACTGATXjCGGAATTG 

GGTCTTCKiAAGATGACAAGCAACTCTrcCCACCTTATCAAGGGGCTCCACT 

ACACCCAGAGTTGGAAAGAGTTCTTAATACATrGGCTGGTAAGATTACAGAAAGCCAGATGAGCCAGCTCAACTA 

CCAAGTCGGTGTTGAAGGCAAGTCAGCAAAGCAAGTAGCCAAGGAGTTTCTCCAAGAACAAGGTTTGT^ 

ATGA 
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4170.5 

ATCATGCATACTTATTTGCAAAAGAAAATTCAAAATATCAAAACAACCCTAGGTGAAATCT 

CGTATCGTTCCGGCTATGGCTGATTTAGGATTTTCAGGAACT 

GTAG T I'l 1 1 G CCC^GTGG ATTT ATTTGCTGGTm AGG AAGTTTTCCTCrCTC 

ATTGTTGACTGGATTGGGATGATTTGTAGCITGACAGGGATTAT 

CTrAATTATCTTTTTGGCTTGATTAACTCTGTTAmAC l'1'l ATGCTGAGGTG 

CTGACGACACTTTACTTCACAGTCATGCAGCCAATTGGACrrCTAGTTTGGATTTATCAGGCA^ 
AAAAGCAGGAGTTTGTCGCGCGTAAACTGGACGGCAAGGGCTGGACAAAGTATCTTTCCATTACT 
GGTTGGCCTTTGGCTTCATTTATCAGTCTATTGGTGCCAATCGTCCCT 

TGGGGTAGGGCAAATCCTCATG ACAGCTGTTTACCGTG AACAGTGG ATATTCTGGGCGGCTACCAATG T C 1 T 1 ' 1 'CA 

ATCTATCTCTGGTGGGGAGAAAGCCTGCAAATTCAAGGGAAATATCTAATTTATCrCATTAA 

GGTATCAATGGAGCAAGGCAGCTAAGCAGAATACTGATTTACTTAACTAG 

4170.6 

ATGAG AAATATG AAGGCAAAATATGCTGTTTGGGTGG L ' 1 ' 1 ' 1 ' IT I C1T AAATTTG ACTTATGCCATTGTTG AGTTTAT 
TGCAGCT GG AGT ATTTGGTTCTAGCGCTG' 1'1'C V I'GCTG ACTCTGTGCATGACTTGGG AGATGCG ATTGCAATTGGA 
ATATCAGCl 1 1 ' I CTAGAAACAATCTCCAATCGTG AAG AAGACAATCAGTACACCTTGGGCTATAAGCGGTTTAGCC 
TGCTAGGAGCCTTGGTAACAGCTGTGATTCTCGTAACGGCCTCTGTTCTAGTCATTTC 
TTTGCATCCGCAACCAGTCAATGATGAGGGGATTCTCTGGTTAGGAATTATTGCGATTACTATCAATCT 
AGTCTGGTGGTTGGTAAGGGAAAGACAAAGAATGAGTCTATTCTGAGTCTGCAT^ 

TGGGTAGCTGTTATCCTG ATGGCGATTGTTtnTCGATTTACGGACTGGTATATCCTAGATCCT C 1 1 1 1G1 CCCITGT 
CA rriCriUCriU 'ATTCTTTCAAAAGCCCTTCCA GjrrnT GGTCTACACTCAAGA l ' l 1 1C1 1 GGATGCTGTGCCAG 
AAGGTCTTGATATCAAGCAAGTAAAGAGTGGCCTGGAGCGATTGGAGVATGTGGCCAGCCTTAATCAGCTTAA 
TCTGGACTATGGATGCT TTGG AAAAAAATGCCATTGTCCATGTTTGTCTAAAAGAAATG^ 
TAAAGACrTCTATTCGAATTTTCCTAAAAGATTGTGGTTTTCAAAAT^ 

ACTCACCAAACCCATAAGCGAAAGGTGTGTGACITGGAACGGAGTTATGAGCATCAACATTAG 
4170.8 

ATGATTGAATACAAAAATGTAGCACTGCGCTACACAGAAAACKiATGTCTTGAG^ 
GATGGGGAATTTATGGTTTTAGTAGGCCCTTCTGGCTCAGGTAAGACGACCATG 

TGGAACCAACT GATGG AAATATTTATAT GGATG GCAAGCGCATCAAAGACTATGATGAGCGTGAACTTCGTCTTT 
CTACTGGTTATU ITl'l ACAGGCTATTGCTCT IT I I CCAAATCTAACAGTTGCGG AAAATATTGCTCTCATTCCTGAA 
ATGAAGGGGTGG AGCAAGG AAG AAATTACG AAG AAAACAG AAG AG CTTTTGGCTAAGG 1 1 OG 1 ' f T ACCAGTAGCC 
GAGTATGGGCATCGCTTACCTAGTGAATTATCTGGTGGAGAACAGCAACGGGTCGGTATTCT 
GTCAGCCCAACATTTTCCTCATGGATGAACC U 1 1 1 CGGCCTTGGATGCTATTTCGAGAAAACAGTTGCAGGTTCT 
GACAAAAGAATTGCATAAAGAGTTTGGGATGACAACG A 1 11 1 1 0 1 AACCCATG ATACGGATGAAGCCTTGAAGTT 
GGCGGACCGTATTGCTGTCTTGCAGGATGGAGAAATTCGCCAGGTAGCGAATCCCGAGACAATTTTAAA^ 
TGCAACAG ACTTTGTAGCAGAC i'l Cj 1 II GGAGGTAGTGTTCATGACTAA 

4171.1 

ATGTCAGCAGTTGCTATTTCAGCTATGACCAAGGTTATGCAAGAAACCCACGGAAATCCTTCT 
ATGGTCGTGAAGCTGGCAAACTCTTGCGAGAAGCCCGTCAGGAACTAGCCCAGTTACT^ 
ATATCTTTTTCACTTCTGGTGGGACIT3AAGGCAATAATACTACCATCA 
AGGAAAACATATCATCACAACTOCCATCGAGCACCATGCTCT^ 

GGGTTTGAAGCAACCATTATCCAGCCAGAAAATCAAGAAATCACAGCCCAGCAAATTCAAAAGGCTTTACCT 
GATACGATTTTGGTTrCTACCATGTTTGTCAATAATGAGACAGGAAACCTACTGCCCATCGCTGAAA 
TACTCAAGCAACACCCTGCTGOTATCATGTTGATCCAGTTCAOT 
ATTGGGCATTGATTTTCTCACl' U CriVll i CCCACAAATTCCATGGTCCTAAGGGAA 

gcatggactttgattcctatctacatggcggagaccaggaacagaaaaaacgtc 
ccattgtaggcatggttck:agccctaaaagaagacctagaaaaacaagaagaacattttca^ 
tagaaactgcc 1 1 ricu ugcag agctggagggcattcagtattacctgaatagaggaaaacatcatcrcccttatgt 
tctcaatattggatttcctggtcagaaaaatgacctcttactccttc^ 

ctggctcagcctgtactgcagg cu '1'i'g i'ccaatccagccat g ' n ' cn 'gaagccatgtatggcgcaaattcagaacg 

CTTG AAGG AATCCCTTCGCATCA G I I'IIiI C GCCACAAAATACCGTTGAAGACCTACAAACCCTCGCAAAAACCTTA 
AAAGAAATTATCGGAGGTTAG 



4172.1 

ATGTTATTCAAATTATCTAj\GGAAAAAATAGAGCTAGGCTTATCTCGTTTATCG 

GTTTTGCCTrGGTCATTTTACTAGGCT CI ' C I ' l CI I ' l l GAGCTTGCCCTTTGTCCAAGTTGAAAGCTCACGAGCGACT 
TATTTTGATCAT C I 11 1 CACTGCTGTCTCTGCAGTCTGTGTGACGGGTCTCTCAACCCTTCCAG TAGC TCACA CCTA 
TAATATCTGGGGTCAAATAATCT G ' IT I G CTCTTG ATTCAG ATCGGTGGTCTAGGGCTC ATG AC CTTTATTGGGGTTT 
TCTATATCCAGAGCAAGG\AAAGCTTAGTCTTCGTAGCCGTGCAj\CTATTCAGGATAGTTTTAGTTATC 
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TCGATCTTTGAGAAAGTTTGTCTATTCTATTTTTCTCACGACC 1 1 \ 1 1 G GTTGAGAGCTTGGG AGCTATTTTGCTTA 
GTnTCGCCTTATTCCTCAACrTGCCTGGGGACGTGGT C I I I lM 'AGTTCCATTTTTCTAGCGATCTCAGCCTTCTGT 
AATGCCGGTTTTGATAATTTAGGGAGCACCAGTTTA I \ 10C1T I T CAGACCGATTTACTGGTCAATCTGGTGATTGC 
AGGCTTGATTATTACAGGCGGCCTTGGTTTrATGGTCTGGTTTGATTTGG 
5 GGACGTCTGCACTTTCATACG AAGCTTGT ACT ATTATTG ACT AT AG G 1 ' I IUMV1 r ATTTGGAACAGCAACTACTCT 

Cl ' l ' lCr I GAGTGGAACAATGCTGGAACGATTGGGAATCTCCCTGTTGCCGATAAG O 1 I I I AGTTAG Cri ' l 1 1 1 CAA 
ACAGTG ACGATGCG AACAGCTGG Cl'l 1 1 CT ACG AT AG ATTAT ACTCAGGCTCATCCTGTG ACTCTTTTG ATTT ATA 
TCTTACAG ATGTTTCTAGGTGGGGC^CCTGG AGG AACAGCTGGGGG ACTCAAG ATTACG ACA'H' ri'I'I GTCCIC 1 1 
GGTCTTTGCACGAAGTGAGCTTCTAGGCTTGCCTCATGCCAA tXj ' 1 'I Xj CGAGACGAACGATCGCGCCGCGAACGGTT 
1 0 CAAAAATCCTTTAGTGTCTTT ATT ATC I'l l I IUATGAGU ICriUATAGGATTGATTCTCCTAGGGATAACAGCCAA 

AGGCAATCCTCCCTTTATCCACCTCGTATTTGAAACCATTTCAGCTC 
ACTCCTGACCTTGGGAAATTG GCTC TCAGTGTrATCATGCCACTTATXnTrATGGG 

ACGAATTGGTCCCTTGACCnii IT! G IHAGCTTGGCAGATTACCATCCAGAAAAGAAAGATATGATTCACTATATG 
AAAGCAG ATATTAGTATTGGTTAA 

15 

4172.2 

ATGTCAGATCGTACGATTGGAATTTTGGGCTTGGGAATTTrTGGGAGC 

ATATGAATATTATCGCTATTGATGACCACGCAGAGCGCATCAATCA Gl 1 1U ACCCA U1T1 1 GGCGCGTGGAGTG AT 
TGGTGACATCACAGATGAAGAATTATTGAGATCAGCAGGGATTGATACCrcCGATACCGTTGTAGT^ 

20 TGAAAATCTGGAGTCGAGTGTGCTTGCGGTTATGCACTGTAAGAGTTTGGGG 

AAAAGTCAGACCGCTAAGAAAGTGCTAGAAAAGATTGGAGCTGACTCGGTTATCTCGCCAGAGTATGAAATGGGG 
CAGTCTCTAGCA(^GACCATTCTTTrCCATAATAGTGTTGATGTCTTTCAGTTC 
AGATGAAAATTCCTCAGTCITGGGCAGGTCAAACrrCrGAGTAAATTAGACCrCCCrrc 
TTTTGGGTTTCCGAGAGCAGGAAAATTCCCCATTGGATGTTGAATTTGGACC\GATGAC 

25 CTATATTTTX3GCAGTCATCAAGKACCAGTATTTGGATACCCTAGTAGCATTGAATTCCT 

4172.3 

ATGAAGTTATTGTCTATCGCAA WW l^CCTATAATGCAGCAGCCTATCTTCATTACTGTGTGG AGTCGCTAG TG A 
TTGGTGGTGAGCAAGTrGGGATTTTCATTATCAATGACGGGTGTCAGGATCAGACTCAGG 
30 AGCTAGCAAGTATCCTAATATCGTTAGAGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGG 
CTTGGTAGAGGCTTCTGGGCGCTATTTTAAAGTAGTTGACAGTGATGACTG 

ATTCITGAAACCTTXiCAGGAACTTGAGAGCAAAGGTCAAGAGGTGGATGTL'l UOI GACC AA i Tl I b i CTATG AAA 

aggaagggcagtctcgtaagaagagtatgagttacgattcagtcttgcctgttcggcaga rrrri ggctgggacca 

GGTCGG AAATTTCTCCAAAGGCCAGTATACCATG ATGCACTCGCTGATTTATCGGACAGA 1' 1" 1 0 1 1 GCGTGCTAGC 
35 CAGTTCTAA 

4172.4 

ATGAAATTCAATCCAAATCAAAGATATACTCGTTGGTCTATTCGCCGTCTCAGTGTCGGTGTTC 
TGGCTAGTGGCTTCrrTGTCCTAGTTGGTCAGCCAAGTTCrGTACGTGCCGATGGGCT 
40 AGTCTTACCTGAAGAGACATCGGGAACGAAAGAGGGTGACTTATCAGAAAAACCAGGAGACACCGTTCT 

AGCGAAACCTGAGGGCGTTACTGGAAATACGAATTCACTTCCGACACCTACAGAAAGAACTGAAGTGAGCGAGGA 
AACAAGCCCTTCTAGTCTGGATACACTTTTTGAAAAAGATGAAGAAGCTCAAAA 

CTTAAAAGAAACTGTAGATACAGCTGATGTGGATGGGACACAAGCAAGTCCAGCAGAAACTACTCCTtjAACAAG^ 
AAAAGGTGGAGTG AAAG AAAATACAAAAG ACAGCATCG ATGTTCC IGClGtl 1 ATCTTGAAAAAGCTG AAGGGAA 

45 AGGTCCTTTCACTGCCGGTGTAAACCAAGTAATTCCTTATGAACTATTCGCTGGTGAT G 
TTACTAAAAGCTTCGGATAATGCTCCTTGGTCTGACAATGGTACTGCTAAAAATC^ 
GATTAACAAAAGGGAAATACTTCTATGAAGTAGACnTAAATGGCAATACrGTrcOT 
TTGATCAACTTCGCGCTAATGGTACTCAAACTTATAAAGCTACTGTTAAAGTTTACGGAAATAAA 
TGACTTGACTAATCTAGTTGCTACTAAAAATGTAGACATC^CATCAATGGATTACTO 

50 AAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCrACCrAGAAAAAGCCAAGGGTGAAG 
CCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTC 

TGVAGGCATCTGACAAGGCACCATGGTCAGATAACGGCGACGCTAAAAACCCAGCCCTATCTCCACTAGGCGAAA 

ACGTGAAGACCAAAGGTCAATACTTCTATCAAGTAGCCTTGGACGGAAATGTAGCTGG 

TCATTCACCAGTTCCGAGCAAATGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGCT 

55 AACCAGACirGGACAACATCGTAGCAACTAAAAAAGTCACrATTAACATAAACGGT^ 
TTCAAAAAGCCGTTGCAGACAACGTrAAAGACAGTATCGATGTTCCAGCAGCCTACCT 
AAGGTCCATTCACAGCAGGTtjrCAACCATGTGATTCCATACGAACrCTTCGCAGGTGA^ 
CTTGCTCAAGCCATCTGACAAGGCACCATGCTrCAGATAACGGTGACGCT 
TGAAAACXnXjAAGACCAAAGGTCAATACTTCTATCAATTAGCCTTGGACGGAAATGTAGCTXXiC 

60 AGCGCrCATTOACCACTTCCGAGCAAACCGTACTCAAA 

CGGTAAACXAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTAT^ 
AACAGTTCAAAAAGCCGTTGCAGACAACGTTAAGACAGTATCGATGTTCCAGCAGCCTACCTAG 

4172.5 
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ATGAAACTAAAAAGTTATATTTTGGTTGG ATATATTATTTCAACCCTCTTAACC A 1 l l ltiOMOl I II 1 1G CCCTCT 
TCAAAAAATG CTGAT TGCGAAAGGCG AGATTTA C 1 1 1 1 1 GCTTGGGATGACCATCGTTCCCAGC C HOI ' CGGTGCT 
GGGATTAGTCTCTTTCTCCTATTGCCAGTCTTTACGTCGTTXjGGCAAACT 

CCAAGGATrTTCCTTCAAATT TGGA GGTTCAAGGTCCTtrrAGAATTTC 1 1 1 IAATGAGAT 

5 GTCCCATGATTTGCAGGTAAGC I ' 1'IUATTCCTTGG AAG AAAGCG AACG AGAAAAGGGCITGATGATTGCCCAGTr 

GTCGCATtjATATTAAGACTCCTATCACTTCGATCCAAGCGACGGTAGAAGGGATTTTGGATG 

GTCG^GCAAGCTCATTATCTAGCAACCATTGGACGCCAGACG^ 

TTTTTTGACCCTAAACACAGCTAGAAATCAGGTGGAAACrACCAGTAAAGACAGTAl^ 

ATTGAGTGCATGAGTGAATTTCAGTTTTTGATTGAGCAGGAGAGAAGAGATCrr 
10 CTGCCCGGATTGAGGGAGATTATGCTAAGCnTCTCGTATCTTGGTGAATCTGGTOT 

TGCTCCACK3AACCAAGCTGGAAGTGGTGGCTAACKntX}AG 

GCAGGGTATTGCCCCAGAGGATTTGGAAAATATTTTCAAACGCCTTTATCGTX^ 

ACAGGTGGTCATGGATTAGGACTTGCGATTGCGCGTGAATTGGCCCATCAATTGGGTGGGGAAA^^ 
^ AGCCAGTACGGTCTAGGAAGTACCTTTACCCTCCrrTCTCAACCTCTCTG 

4172.6 

ATGTTTGGTCA AACGGCT CAACATGGTCTTACGAATAGCCTGAAAGACTTCTGGA H ' lT 1 C TGCTGAATATAGGTC 
CACAATTGGCCjI 11111 IGCCAGATGCTCCGCTGTTCCAGATCGGTTGAGCAGGGTACTGQAAATCACCGTCGTGA 
GTTCAATATGATTCAGCAGATATTCTCGCATTTTGGGATGACTGVCTTGGGACAAATCA^ 
20 TCGATTG ACCTTGAGTTGCTGGTCAATGCACTf AATCATCAC 1' 1 Ul' 1CATTG ACAGACTGGTCCTCACCCCCAATC 

AAATAACGATAGAAATCGACAGGCAGATAGTACATtXTTCTTGACCTGCTGAAGGGG 

TCGACATAAAAAGTAT GTTC AGGCAGTTAGAACTGGCTAGCACGCAACAAATCTGTCCGATAAATCAGCGAGTC 
ATCATGGTATACTGGCCTTTGGAGAAATTTCCGACCTGGTCCCAGCC 

25 4174.1 

ATGG AACATTTAGCAACTTATTTTTCAACCTATGGAGG AG C 1 l ' l LT1 CGCTGCATTGCG AATTGTATTGGCGGTTG 

GATTAAGCGGTATGGGGT CTGC TTATGGAGTTGGTAAGGCTGGGCAATCTGCCGCAGCITTACT^ 

CTGAAAAGTTTGCCTG\GCIM'1UATATTGCAATTATTGCCCGGAACACAAGGA 

TTTAA IT IIjO 1 1 G CAATTAACTCCAGAACTTCCTTTAGAAAAAGGCGTTGCTTA IT 1 (J 1 11 U 1 AGCTCTTCCAATTG 
30 CTATTGTAGGATACTTTTCAGCTAAGCATCAAGGAAATGTAGCAGTAGCGGGAATGCAAA 

CAAAAGAATTCATGAAGGGAGCAATTTTAGCTGCCATCKTrAGAAACCTATGCA^ 1 1 1 GTCGTATCATT 

CATTTTG ACCCTTCGTGTATAA 

4175.2 

ATGTTAAAATCAGAAAAACAATCACGTTATCAAATGTTAAATGAAGAATTGTCCrrCCTATTGGAAGGC^ 
35 AATG IT n UGCTAATCTTTCCAA CGCC AGTGCTCTCATAAAATCACGTTTTCCTAATACCGTATTTG 

TTTGTTCGATGGAAAGGAATTGCnTrrAGGCCCCTTCCAAGGAGGTGTTTCCTGCA 
GGTO MTU 1X3GTGAGGCAGCTCACTTTCAGGAAACTGTTATTGTTGGAGATGTGACGACCT 
CirGTGATAGTCTAGCTAAAAGTGAAArrGTGGTGCCGATGATGAAGAAT^ 
GGATTCTTCAGAGATTGAGGATTACGATGCTATGGATCGAGATTATTTGGAACAATTTGTC 
40 AAGACAGCATGGGACTTTACGATGTTTGAGGAAAAATCTTAA 

4175.3 

ATGTCAGTATTAGAGATCAAAGATCTTCACGTTCAGATTGAAGGAAAAGAAATTTTAAAAGGGGT^ 
CTGAAAACAGGAGAAATTGCCGCTATCATGGGACCAAATGGTACAGGTAAATCGACTCTTTCTGCCGCTATCATG 
45 GGAAATCCAAACTATG AAGTAACTAAAGGTG AAG I t l'l G 1' l'l G ATGG CGTAAACATCCTTGAGTTGGAAG TG GAT 

GAGCGTGCGCGTATGGGA C r l ' l 1 CCTTGCTATGCAATACCCATCAGAAATCCCTGGAATTACCAATGCTG AGTTTC 
TTCGTGCCGCTATGAATGCGGGTAAAGAAGATGATGAGAAGATTTCAGTTCCTGACrrrrATTACT 
AAAAATGGAATTGCTCAACATCAAAGAAGAAATGGCAGAGCGTTACCTCA^ 

AAAACGCAATGAAA ritrn CAA Cl l'l'l GATGTTGGAGCCAACATTTGCTC 1 1 1 1 GG ACO AG ATTG ACTCAGGTCTT 
50 GATATTGACGCTCTTAAAGTTGTGTCTAAAGGTGTCAATGCCATGCGTGGTGAAG O 1 1 1 1 IKj 1 UCT ATGATCATCA 

CTCACTACCAACGT C l ' l ' l ' I G AACTATATCACACCTG ATCTGGTACACGTGATCATGG AAGGTCGTGTTGTCCTTTC 
TGGTGGTCCAGAATTtXjCTGCGCGTTTGGAACGTCAAGGATACGCAAAATTAGCrc 
CAAGGAAGAATTGTAA 

55 4174.4 

ATGCCCTACAAAAGACAAAGGA Ol 1 1 1 i CAATGGCACTTTCTAAACTAGATAGCCTTTATATGGCAGTGGTAGCAG 
ACCATTCGAAAAATCCACATCACCAAGGGAAGTTAGAAGATGCTXjAGCAAATCAGTCTCAACAATCCGACTT 
GGGATGTCATCAACCTCTCTGTCAAGTTrGATGCAGAGGA 
CACGATTTCAACTGCTTCTGCTAGTATGATGACAGATCCCGTT^ 
60 GCGACTA1 1111 1 C 1 UAAATGGTTG\AGGGCAAAAAG ATGAGCGTCAAGACCAACTTGGAGACGCGGCATTCTTG 

TCAGGTGTTGCCAAATTCCCTCAAAGAATCAAGTGTGCAACCCTAGCTTG 
AATCAAGAAAAACAGTAA 

4175.5 



SUBSTITUTE SHEET (RULE 28) 



WO 00/06737 



PCT/GB99/02451 



57 



ATGAAAATTCAAGACCTATTCAGAAAAGATCTCA 

GACGAGATGATTAAAAATTTGAC^ACCACGGTTATGTAACAGATTTTG 

CGTGAAGCTTTGACTTCTACTGGTTTGGGTGATGGAATCGCA^^ 

GCGACAGTTCTATTTGCTAAGTCAAATAAGGGTGTTGACTACGAGAGCTTGGATGGAC^ 
5 TCATG ATTGCA GCTCCAGAAGGTGCCAATGATACTCACrTGGC^GCCTTG 

AG ACGGTnTGCAG ACAAACITCGTCAAGCAAC ATCTG CAG ACCAAGTT ATCG AA C 1'1'ITI G ACCAAGCTTCAG AA 
AAAACTGAGGAACTTCTTTCAAGCACCTGCTAATGACTCrGGTGAaTTATCGTAGCT 
GTATTGCCCACACmCATGGCCCAAGAAGCCCHTCAAAAACT^ 
AAACCAACGCrTGCTAGCGGTGTTGGAAATCAACTAACTG 
1 0 CAGCAGACAAGGCCGTTGAAATGGATCGATTTGATGGAAAACCATTGATCAATCGTC 
GTAAGACAGAAGAGCTAATTAACTTGGCTCTTTCAGGAGATACTGAAGTCTAC 

CTGCAACAGCCTCTAACGAAAAACAAAGCCTTGGTGGTGC l 1 1 IXi t ACAAACACTTGATGAGTGGTGTATCTCAAA 
TGTTACCATTCGTTATCGGTGGTGGTATCATGATTGCCCTTGC C I IITIO ATTGACG C I GC I 1 1 U GGTGTTCCAAAT 
G AAAACCTIGGCAATCTTGGTTCTTACCATG AGTTAGCTTCTA I G l'l C^TGAAAATTGGTGGAGCTGCCTTTGGTTT 
1 5 GATCCTTCCAGTCTrrcCGGGTTATGTTGCCTACT 
GCTGCTATTGCCAAAGAAGGniTUCCTTTGCT 

CAGGTGTCrCATCTGGTTTCCTAGGTGCC C 1 1G l ' 1 GGTGG ATTTATCGCAGCTGC L T I trO " H CT ItiCCATCAAG AAA 
TACGTTAAAGTTCCrCGTTCACTCGAAGGTGCrAAATCAATCCTTCTATTGCCA C ' Mil 1 GGAACAATCTTGACAG 
GA 1TI G H ATGCTAGCTGTGAATATCCCAATGGCTGCAATCA^ 

20 AGG AGGTTCAG CTGTCC I TlTIXXTrATCGTCCTTGGTGG AATGATGGC TO 1 ' 1 G ACATGGGTGGACCAOTT AATAAA 

GCAGCTTATGTCITTGGTACAGGTACGCTTGCAGCAACT U I ITU l ' l CAGGTC G VI ClOT AGCCATGGCAGCAGTTA 
TGGCTGG AGG AATGGTGCCACCACTTGCAATCTTTGTCGCAA C IV 1 ' I ' Cl ' 1 ' 1 ' I CAAAGATAAATTTACTAAGGAAGA 
ACGTAACTCTGGTTTGACAAACATCATGVTGGGCTTGTCATTTATCACTGAGGGAGCGA 
GACCCAGCTCGTGCGATTCCAAGCTTCATCCTTGGTTCAGCAGTAGCAGGTC 

25 AACTCATGGCGCCACACGGAGGAATCTTCGTTATCGCCCTTACTTCAAATGCTCT 

GTAGGAGCAATCGTAAGTGOl Ol U/i 1 I ATGGTTACCTACGCAAACCACAAGCATAA 

4173.6 

ATGGCAAACAAGAATACAAGTACAACAAGACGGAGACCGTCTAAAGCAGAACTGGAAAGAAAAGAAGCGATTC^ 

30 ACGAATGTTGATTTCGTTAGGAATTGCGATTTTATTGA^^ 

CTTTATAATTTAATTCGCTTGCTAGTGGGTAGCCTAGCTTATCTXKjCGATATTCGGCCTA 

TTTCAACrrGGATACGAAAACAGGAAGGACTCTrATCTGGCITIT^ 

AGGCCTACTTGGTTTGGAAATATGGTTTGGACAAGTCCGTTCTAAAAGGGACXATGGCTC 

G ACTGGTTTTCG AACGACTAG C 1 T 1 0<J 1 O G AGGCGCCTTGATCOGGGTCGCTCTTTATATTCCAACAGC l ' 1 ' 1 i ' C 1 C 

35 TTTTCAAATATCGGAACTTACTTTATTGGTTCTATCTTGATT^ 

TACGATATTGCTGAATITTTCAGTAGAGGCTTTXjCCAAATGGTGGGAAGCGCACGA 

TTTGTCAAACAAGAAGAAAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACT 

AGCCTTACTCGATTTGCCTCCTGTTGATATGGAAACGGGTGAAATTCTGA 

CCTATTCCAGAAGAAAAGTGGGTGGAACCAGAAATCATCCTGCCrCAAGCTGAACTTAAATTCCCTG 
40 . G ATG ACTCAGATG ACG AAG ATGTTCAGGTCGA J ' IT 1 1 CAGCCAAAGAAGCCCTTGAATACAAACTTCCAAGCTTA 

CAACTCmGCACCAGATAAACCAAAAGATCACrrCTAAAGAGAAGAAAATTGT^ 

GAAGCAACCTTTGCTAGCTTTGGTATTAAGGTAACAGTTGAACGGG<XGAAAT^ 

GAAGTCAAGCCGGCTGTTGGTGTAAGGGTCAACCGCATTTCCAATCT 

CCAAAGATGTCCGGATTGAAGCACCAATCCCTGGGAAATCCCTAA 
45 CCACTGT ATCTTTCCG AG AACTATGGG AACAATCGCAAACG AAAGCAG AAAA l'l 1 1 U 1 1 GGAAATTCCTTT AGGGA 

AGGCKnTAATGGAACCGCAAGAGCrnTGACCmCTAAAATGCCCCACTO 

ACK)GAAGTCAGTACKL\GTTAACGGCATTATTGCTAGCATrCTCATGAA^ 

G ATGGTCG ATCCCAAGATGGTTG AGTTAT L ' lti ' n ' 1 ACAATGATATTCCCCAC L ' ll 'l' 1 U ATTCCAGTCGTGACCAAT 
CG^CGCAAAGCCAGCAAGGCTCTGCAAAAGGTTGTGGATGAAATGGAAAACCGTTATGAACT 
50 GGAGTTCGGAATATTGCAGGTTTTAATGCCAAGGTAGAAGAGTTCAATTCCCAGTCTGA 
CTACCATTCATTGTCGT O ATTGTGGATGAGTTGGCTCACCTCATGATGGTG 

TC A I ' CCLi IV n ti GGCAGAAGCCGCGTGCTGCAGGTATCCACATGA n c n ' O CAACTCAGCGTCCATCTGTTGATGT 
CA I C 1 C 1 O O I II (j ATTAAGGCCA A 1 0 IT CCA TCTCGTGTACCATTTOCGGTTTCATCAGGAACAGACrCCCGTACG 
A l 1 nO OATOAAAATGGAGCAOAAAAA CnCl I G GTCG AGG AGAC^TGCTCTTTAAACCGATTGATGAAAATCAT 
55 CCAGTTCGTCTCCAAGGCTCCTTTATCTCGGATGACGATGTTGAGCGCATTGTGAAC 

ATGCAGACTACGATG AGA O 1 1 1 1 GATCCAGGTG AGGTTTCTG AAAATGAAGGAG A A 1 1 11 IG GATGG AGATGCTC 
GTGGTGATCCGC1 111 1 0 AAGAAGCT AAGTC 1 1 IGGl ' l 'ATCGAAACACAGAAAGCCAGTGCGTCTATGATTCAGCG 
TCO 1 r I ATCAGTTGGATTTAACCGTGCGACCCGTCTCATGGAAGAACTGGAGATAGCAGGTOT 
GAACGTACCAAACCTCGAAAAGTGTTACAACAATAA 

60 

4176.1 

AT GACTTA TTTTAAAAAATA TAAAT TC GATAAAT CCCAGTTCAAACT 

CTOri'1'1 ICTAGTTCTCTTGArri riGGC'l 1111 1 GGCTGG AAAGGTCTTCAAATTGGTGCTn'GACAGCCU 1 1 1 1 iA 
GCCTGAGGG AGA G 1 1 1 ' 1 G ATG AGAGTGTTC ATTTTGGG ACTTCGCGTATTCTAGG AAATAGT ATCGGTX/GACT 
65 TGCCTTGGT C 1 1 L ' I ' l ' lTl ATTAAATAC l ' lT IT 1 CCACG AAGCCTTTTGGGTGACCTTGGTAGTTGTTCCAATCTGCA 
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(XAXGTTAACCATTATGACAAATCTAGC 
CATTACCCTATCAATC^ 

TTTGTCGCAATTATCGTAAATTACGATATTGATCGTATTCGTCTCTTTTTA 
4178.2 

ATCAATAAATCAGAACACCGCCACCAACTTATACGCGCTCTTATCACAAAAAACA^ 
TO^GCCCTTXnTCCTGAGAACGACATTCAACTAACCCAGGCAACCCTCT 

T E2 AA ^TCT ACATGGAAGACGCCCTTCTCTGGATGCGCCCAGTTCAACACCAAGTCCTACTAA^ 

TCTOTOTATCATCTGTGA^GATGCAGATACn^ 
TTTCTTTGAAGAATAA 



4179.1 

I J ATGAAAAGTATAAAATTAAATGCrCTATCTTACATGGGMTTCGTGTCTTGAATATTATT^ 
AACCTATGTCGCGCGTGTCTTGGACCGAACTGACTATGGTTAC^ 
TGCCCTTTGCAACTTATGCTGTCTATAACTACGGTTTAAGGGCTATCAGTAATCT 

taacagaac h 1 1 1 ctagtl 1 1 ii 1 1 atttgtgcatcgcttgtacgattttgaccactgctgtctatatcctagcct 
atcctctcttctttactgataatccaatcgtcaaaaaggtct 

ttttcaatcgaatg ggtc aatgaagctcrggaaaattaca g i i muc1 u l 'acaaaactgccttcatccgtatcct 
gatgctggtctctattttcttatttgttaaaaatgaacacga 
cgctgattaactacctgattagttatttttggattaaaagagacatcaaacttg^ 
taaaccactctttctccctctgacagccatgttagtctttgccaatgccaatatc 
tcttcctcgttaaaacagggattgatgtcaacgttagttactataccatagctcagcga^ 
£3 tcgggttctaacaggtgcaattggagtgagtcrr^ 

ctatgtttctctggttaatagaggtagtcgaatctttaacttctttatcattcdv 
taggaccaaatgccatcctacmacggtagtgaaaaatatatcg^ 
tcgtacgattatcctggccttagataccattcntggttcccaaattctctttacg 
acagtctatacagtctttgcrgggctactcaatttgggcttgaatagtct cl vv vv ltl ' c aaccatatcgtggctcc 

TGAATACTACTTACTGACAACTATGCTATCAGAGACTTCTCTACTTXnTTTCTATAT^ 

AACTCATCCACTTGGGACATATCTTTAGCTATACTGTTCGATACTL 1 C 1 L T IT I CA CITTCCTTTGTAGCAATTTATT 
TCCTGATTAApTCCrGTATCCT 

ATA cluui 1 i uATTGTCTTGCTATCAGCTATCTCTTATATTAGTCTA C 11 U 1 C 1 ' I 'CACAAAAGATAGCATTTTCTAT 
^ 1 GAAl 1 1 1 ' I AAACCATGTCCTAGCCTTAAAAAATAAATTTAAAAAATCATAG 

4179.2 

ATGAAACAACTAACCGTTGAAGATGCCAAACAAATTGAAT^ 
AGCACA>TATCAACTATATTATTAACTACGGTACTCTGATTGGGGCGGT^ 
CGACGATATTGATCTCrrCCATGCCTAGAGAAGACTACCAACGATTTATTAACATTrrr^^ 
W GTATAAGCTCCTATCCTTAGAAACTGATAAGAACTACTTTAACAACTT^ 

ATTATTGATACTCGAAATACAAAAACCTATGAGTCTGGTATCTTTATCGATATTTTCCCTATAGATCGCT^ 
TCCTAACKrrCATTCATACrTGTTATAAACTGGAA^ 

TATAAGGATAGCCTTTTAAAAGATTGGATACGAACAGCCTTCTGGTTACTCCTTCGACCGGT^ 
TGCAAATAAAATCGAGAAAGAAATTCAAAAATATAGTCGTGAAAATGGGCAATATATGG 
^ ATTTAAGGAAAAGG AAGTC TTCCCAAGTGGTACCTTTGATAAAACAATCGATTT 

CCIX3CACCTGAAAAATTTGATACTATTTTGACACAATTTTATGGAGATTATATGACCCT 
GCTTCTACAGTCATGAATTTCACGCTTATAAATTGGAGGATTAG 

4179.3 

5D ATGATAAAAATCAATCATCTAACCATCACACAAAACAAAGATTTACGAGA 

ATCCAAG ACGGGGAAAAGOTT GCTATTATTGGTGAAGAAGGAAATGGCAAATCAA 

GGGOAAG^i i m i wi uATTT^CTATCAAGGOAAACATCCAATCTGACTATCAGTGVCrGGCCTACATTCCTCAAA 
AAOTC CCTaAGG ACCTAAAAAAOAAAACnTACACGACTA Cl 1U1 1 1 1 A GATTCTATTGATTTAGACTACAGTAT 
CCTCTATCGTTTGGCGGAGGAATTGCATTTTGATAGCAAT^ 
JJ GCGGGCGAACXnTrcAAAATTCAGCTTATCXATGAGTrA^ 
CAAATCAOTAGACCTTGAGA^ 

1 1 1 uuCATGATGAAGAC 1 1 1 1 CT l'1'CTGAAACGGCAGACACTATTGTTCACTTGCGACTGGTCAAACACCGTAAAGA 
AGCGGAAAOTCTAGTAGAGCATTTAGACTATGATAGCTATAGTGAGCAGAGAAAG 
TCAGCAAGCTGCTAACAACCAAAGAGCCTACXjATAAAACCATGGAAAAACAT^ 
OU AA^CTGCGCTTCGAGCTACCAAAGATAGTACnjCCGGTCGCCTATTGGCTAAAAAG 

AAGAAAAACGCTACGAAAAGGCAGCTCAGTCCATGACTCAAAAGCCACTTGAAGAGGAACAAATCC^ 
T TTCAGA CATCCAACCATTACCAGCrrCTAAAGTCTAGTCCAACTGGAAAAAGAA^ 
AGTTTTGGTrCAAAAACTACAACTAACTGTCCGTGGCCAAGAA^ 
GC CAAAT GGTCnTGGGAAATCAACTaxrrTAGCCAAGmCAGAGACrrCT 

03 tggttttatgccacaagattaccacaaaaaactgcaattggatttatcccg\atagcctatctcact 
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^^i^^?^^ AGO AmACXACAG A«Aj\ATCC AAXCXCACCX AG CT ACjTCXC A^TTTCACTT ATCCAG AAATGCAG CATC AA 
ATTCGCrcCTTATCTGCCCCACAACAGGGAAAACTCCTCCTITlXjGATTTACT 
TGCTGGATGAACCCA^CGAAACTTTTCTCCCACTTCTCAACCCCAAATCAGAAAACT 
CGGTCTCATCACTGT^ 
5 GGTTTGAAGCTAGTTAATTTAGAAGATTTATAA 

4179.4 

ATGAAACCAAAAACATTTTACAACTTGCTTCCCGAGCAGAATCTTC 
GAACGTTATTTTGAGCTCTTGGTCGAGTGGAATGAGAAGA 
10 TATCTCAAACATTTTTACGATTCGATTGCACCCATTCTTC^ 

TATCGGGGCTGGGGCAGGATTTCCTAGTCTACCAATGAAAATTCTCTATCCGGAGTTAGATCTC 
TCACTCAATAAGCGCATCAACTTCCTAG\ACT 

gacgtgccgaagattttgcccaagacaagaacttccgtgctcaatatga rrnti iaacagctcgtgcggttgcccg 
tatgcaggtcctatctgaattgactattccctaccttaaggttggtggcaaactattagcact 
1 5 gcgcctgaggaattattagaagctaagaatgccctcaatctcctt^ 

ccctaccgaatagagatccgcgctatatc^cagtggtagaaaagaaaaaagaaacaccaaataaatatc^ 
aggctcgtatgccaaataaacgcccactttaa 

4179.6 

20 atgagtattaaactaattgccgttgatatcgacggaacccttgtcaacagccaaaaggaaatcact 
tttctgccatccaagatgccaaagaagctggtgtcaaagtcgtgattgcaactggccgcc 
caaacttctacaccacttgcagttgagagacgagggggactatctggtaacc^ 10 1C CAAGA 

aactgctacaggacatgagattatcagcgaatccttgacttatgaggattatctagatatggaattc 
aagctcggtgtccacatgcatgccattaccaaggacggtatctatactgc^ 

25 gtacacgaatcaaccctcgtcagcatgcctatcttct^ 
aatgtatgtttatcgatgaacxagaaatrctcga 
ctccatcaacaaatctgctccnttctacctcgaactcctraaaaag 
ttggctgaaaaactcggattgaccaaagatgaaaccatggcaatcggtgatgaagaaaatgaccff 
gaagtcgttggaaaccccgttxrrcatggaaaatggaaatccagaaatcaaaaa^ 

30 acaaatgacgaatccggcgttgcccatgccatccgaacatgggtactgtaa 

4179.7 

ATGACTTGGATTATTCTTtXjAGTTATCGCTCITATTGTTA IIMM ' IXJI GA rTGIT AGCTATAACG O ' l 1 ' lGGl IAAAAA 
TCGTATGCAAACCAAGGAGGCTTGGAGTCAGATTGATGTTCAGT 

35 ATTGAGACTGTAAAAGGTTATCCCAAATATGAAGCrrTCTACCCTroAAAAGCT 

GCCCCAGCG ACTTCACCAGCAG AAGCT ATG AAAGCCAGTC AT^XT CT ITITI OCAG 

TTGCAG AAAGCTATCCAGATTTG AAAGCTAGTGCTAAL I ' 1 1 G I I AAATTGCAAGAGGAGTTGACAAACACAGAAA 
ATAAAATTTCTTACTCTCGTCAACTCT ATAACAG TO T 1 GTCAGCAACTACAATGTAAAATTAGAAACTTTCCCGAG 
CAATATTATCGCTGGAA 1 0 111 Ci GATTTAAAGCGGCAGATTTCCTTCAAACACCTGAAGAGGAAAAGTCGGTTCCr 

40 AAAGTTGATTTTAGCGGTTTAGGTCACTAA 

4179.8 

a tg1 i g l'l 1gatcaaattgcaagcaataaacgaaaaacctgga i 1 i ' lul ' igclo cta ntntx tactcttagctct 
tgttgg ttat gcggttg gttat ctctttataagatctggacttggtg ij 1 1 1 og 1 1 attgcactgattatcggcttta 
45 tctacgctttgtctatgatttttcaatcgacagagattgtcatgtccatgaa 

AACGGCACCAGACCTCTACCATGTAGTGGAAG ATATGGCTCTGGTCGCI^GATTCCTATGCCC CXj i ' 0 1 ' I I 1 CATC 

attgatgatccagccttaaatgcctitgcgacaggttctaatcctc^ 

tactagctatcatgaatogtgaagaactagaagctgttatgggacatgaagtcagtcatat^ 

ccgtatttcgactattgcagttgcccttgctagtgctatcaccatgctttct 

50 cttggtgg agcaggtcgcag acoaagtg atg atg accg ag atgg aaatc ixigitl 
ccctacragctatrotactggcacctctcgctgcaac l ' 1 1 go i ' l ' c agctcgct a 1 ' i ' l t-'i c gtcagaggga a 1 '1'i c i g 
gcagatgcatctagtgtcgagctgactcgcaatccccagggaatgattaatgccct 
aaacctat gagtc gccaogtcgatgatgctagcagtgccctttatatcaatgatccta^ 

^ caaaaactcrrttatacccactxacctatctcagaaccw 

4179.9 

ATOAAATTAAATATTCAAGAAATTCCrrAAGCAGTCTGAAGGTTTC 
ACCTGCGTCCACGTAATCAAGAAATTTTAGATGTAAAAGATATCC^ 

GTATGTATTTGTTAGATTATCAACT ATCTTAT ACCATT G T TCI T GCTTCGAGTCGCAGTATGG AGCCAGTTG AGTTA 
60 GTTGAATCTTATCCAGTCACGGAAO 11 11 CATGGAAGGCGCAACTAACCAGCTAGATCAAGAAO 1 II 1 AGATG ATG 

ACTTGGTCTTGCCCATCGAAAATGGGGAGCn^ACCTTGCTGAGAGTGTATCA 
TATCAAGGTCTTGACGGCTGAAGAAGAAGCTGGTCAAGGATTTATCTCAGGAAATGACTGGCAAAT 
GGAAGAATACCAAGCTCAAAAAGCAGTAAAGAAAGAAG AAAACAO ICU I 1 1 OC 1 OGCTTACAAGG ACTATTTG A 
CGGAGATGAATAA 

65 
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4179.12 

ATGGAGTTATTTATGAAAATCACAAACTATGAAATCTATAAGTTAAAAAAATCAGG^ 
TGAAAGTGCTAGAATACCCTGAAAATGTTGATCAGCAGCTTTTGTTGGCTGATATTCCAGATATCrCAG 
TAATCCAGCC GU \ I I ATGGAACGTTATTTTCAGATAGACGATGCGCATTTGTCGAAAGAGTTTCAAAAATTTCG\ 
5 TCmCTCTATTTTAGATGACTGTTATCC^ I ATTTTACAA 

GGGAAATCTTTjACCTCCTGAAATTCCCGAAGGTAGCGGT 
GTCAGTTOAAAAAGTCATTCAAGGCTTGGAAAATGAACT^^ 

AGCAGCTCATATGGCAGCTCTTCAG AATGGCGG AAAAACCATTGCAGTG ATTGGAACAGG ACTC IT !TA 

TCCTAAAGCCAATAAACGCTTGCAAGACTACATCGGCAATGACCATCTGCTTTCT 
1 0 ACAACCTCTGAAATTTCATTTTCCTGCCCGTAATCGCATCATTGCTGGA C r 1 ' l ' Xi ' 1 C GTGGTGTG ATTGTAGCAGAG 

gctaagatgcgttcaggtagtctcattacgtgtgagcgagcaatggaag 

gtagcattttagatggactatcagacggttgccatcatttgattcaagaag 

aagatgttcttgcggaatttgaattttaa 

15 4181.1 

atgaaacgtcaattagccttggtcgtctttagtggtggtcaagattc^ 
actatgaaacagtcgaagctgtcacctttgcctacggccaacgtcatcacc^ 
cgctaaggaacagggcatogtcaccatatcctcgatatgtctctgoxkigacaaatcac^ 
gcgacgattcatatttcctacattcctgacaagctctgtgtcgagtcaaaatc 
20 accgaaaccacggagattrccacgaaaactgtatcaacaccatcgggaaagacttggtcaacttg 

gctatttagaagtctggggaaaattcacrccgcgcggtggcatttcaatcgacccctactacaactacggtaag 
aggaactaagtatgagggcttggcagaacaacgcctcttc 

CTAA 
25 4181.2 

ATGACCGAAACGGTAGAAGATAAAGTAAGTCATTCAATTACTGGGCTTGATATCCTCAAGGGGATAGTTGCTGCG 
GGAGCTGTCATAAGTGGAACCGTTGCAACTCAAACGAAGGTATTTACAAA 

GTAGAGAAAACGGATGCrrTGGCAACAAATGATACAGTAGTTCTAGGTACGATATCTACAAGTAATTCAGCGAGT 
TCAACTA O VVIXt IO GCTTCAG AGTCGGCAAGTACATCTGCATCTGAGTCAGCCTCAACCAGCGCTTCGACCTCAG 
30 CAAGTACAAGTGCATCAGAATCAGCAAGTACATCGGCrrCGACAAGTA 1 1 1 L 1 U CATCATCTACTGTGGTAGGTTC 

ACAAACAGCTXjCCGCTACAGAAGCAAGTGCTAAGAAGGTCGAAGAAGATCGTAAGAAACGVGCTAGTGATTATC 
AGCATCAGTTACAAATGTCAATCTCCAATCTTATGCTAAGCGACGCAAGCGTTCAG^ 

CTGGCTTCTATAAAAAATGCTGCTGl H IT r C IXjGCAAT ACG ATTGT AAATG GCGCCCCTGCAATT AATGC AAGTC 

taaacattgctaaaagtgagacaaaagtttatacaggtgaaggtgtagattcggtatatcxtg 
35 taaattgaaagtgacaaatgatggttcaaaattgacctttacctatacggttacctatot 

gatcttggtaatatatcaagtatgcgtcctggatattctatctataattcaggtacttcaacacaaacaatgtt^ 
cccttggcagtgatctrggtaaacctrcaggtgtaaagaactacatt 

ctataatacatctacaatg acg acgcagggtagtgggtatacttgggg aaatggtgcccaaatgaatgg 1 " i iu 11 
gctaagaaaggatatggattaacatcatcttggactgtaccaattactggaacgga 

40 tacatcctttacatttaccccttacgctgctagaacagatagaattggaattaact 

gtagttgaatctagcacgaccagtcagtcactttcacagtctaagtcactctcagtaagtg 
cagcttcagcatcaacaagtgcgtcggcttcagcatg\accagtgcctcggcttcagcgtcaaccac 
cttcagcaagtaccagtgcttcagtctcagcatcaacaagtgcttcagcct 
agcaagcacatcagcatcrcaatcagcgtcaaccagtgcttcgccttcagcaagtac^ 

45 atcaaccagcgcctcggcctcagcaagcacctcagcttctgaatcggcctcaaccagcgcctcggcctcagcaag 

caotcagcttctgaatcggcctcaaccagcgcctcagcctcagcatc 
aagcgcctcgggttcagcatcaacgagtacgtgagcttcagcgtcaaccagt^ 
tgcgtcagcctcagcaagtatctcagccrrctgaatcggcatcaacgagtc 
gtcagcctcagcaagcacctcagcttctgaatcggcctcaaccagtgcgtcagcct 

50 agcttcagcaagtaccagtccitcagcctcagcgtcgacaagtgcgtcggcctc 
atcaaccacrrgcgtcagcctcagcaactactagtgcatcggctrcag^ 
accagtgcgtcagcrrcagcaagtaccagtgcttcagtctcagcatcaacaagtgcttca 
agtccctoggcttcagcaagcacatcagcatctgaatcagcgtcgacaagcgcctcagct^ 
gcgtcagcttcagcatcaaccagcgcctcggcctcagcaagcacctc^ 

55 tcggcctcagcaagcacctcagcttctgaatcggcctcaaccagcgcctc^ 
gcrrcagcaagcacaagcgcctcgggttcvgcatcaacgagtacgtcag^ 
tcagcatcaacaagtgcgtcagcctcagcaagtatctcagcgtctgaatcggcatcaacgagtgcct 
gcatcaacgagtacgtcagcctcagg\agcaccrcagcttctgaatcggcctg\accagtgcgtcagcctcagca 

TCGACAAGCGCCrCAGCnXIAGCAAGTACCAGTGCrrCAGCCTCAGCrCGA 
60 C^TCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCAT^ 
CCX}CTrCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACC^GTC 

CTCAGCATCAACAA G 1U C1 1 CAGCCTCAGCATCGACAAGTGCcTCGGCTTCAGCAAGCACATCAGCATCTGAATCA 
GCGTCGACAAGCGCcTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGACAAGTGCGTCAGCCTCAGCA 
AGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCATCGGCTTCGGCGTG\ACCAGTGCATGVGAGTCAGCAAGT 
65 ACCAGTGCGTCAGCTTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACT 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



61 



PCT/GB99/02451 



AGCGCCTG\GCCTG\GCCTCAACCAGTCCGTCAGCCrCACCAAGTATCTCAGCCTCTGAATCCGCATCAACGACT 
GCGTCCGCTrCAGCAAGTACrAGCGCCTCAGCCTCAGCCn-CAACAAGTGCATCG 
TCTGAATCGGCATCAACGAGTGCCrrCCGCTTCAGC\AGTACTAGCGCCTCAGCCTCAGC 
GCTTCAGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTC 
5 CAGCGTCAACGAGTGCGTCTGAGTCAGG\TCAACGAGTGCGTCAGCCrCAGCAAGCACATCAGCTTCTGAATCTG 
CATCAACCAGTGCGTCAGCCTCAGCATCGACAAGCGCCTCAGCrTC 

CGACAAGTGCGTCGGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCAAGTACCAGTGCGTCAGCCT 
CAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGCATCAACG^ 
CAGCTTCAGCATCAACGAGTGCATCGGCTTCAGCATCAACCAGTGCATCAGAGTCAG 
1 0 TTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGTACTAG 

4183.1 

ATGGGGGTCGAAACTTGGTTTTATTCTAGCATCTGCIXKjCTGGCCATCGGGCTTGGTTC 

ACATG ACTCCTGCTA ATGGCGGTGGAGGC n I I I ACTAATCTTTCTCATTTCCACTATTTTAATCGGTTTCCCTCTC 
1 5 CTGCTGGCTGAGTTTXjCCCTTGGCCGTAGTGCTGGCGTTTCCGCT 
ACAAGTACAACTTTATCGGTTGGATTGGCGCCTTTGCCCTCT^ 

TGG ATTCTAGTCTATCTAGGTATTGAGTTTGGGAAATTGTTCCAAC r 1 GO lOGAACGGGTGATTATGCTCAGTTAT 
TTACTTCAATCATTTCAAATCCAGCCATTGCCCTAGGAGCTCAAGCGGCCn^ 

TCACGTGGGGTTCAAAAAGGGATTGAAAGAGCTTCGAAAGTCATGATGCCCCTGCTCTTTATCGTL'l 1 101 1 1 HA 
20 TCATCGGTCGCTCTCTCAGTTTGCCAAATGCCATGGAAGGG O 11 Crrr ACTTCCTCAAACCAGA Cn'n CAAAACT 

G ACTACCACTGCTCTCCTCT ATGCTCTGGG ACAAT C l ' l IL ' l l ' I GCCCTCTCACTAGGGGTTACAGTCATGTTGACCT 
ATGCriCriACTTAGACAAGAAAACCAATCTAGTCCAGTCAGGAATCTCCATCGTAGCCATGAATATCTCGATAT^ 
CATCATGGCAGGTCTAGCCATTTTCCAAGCTCGATCCCCCrrCAATATCCAGTCTG 

TTTATCGTCTTGCCTCAACTCTTTG ACAAG ATGCL" 1 1 1 1 GG AACCATTTTCTACGTCCTCTTCCTCTTGCTCTTCCTT 
25 TTTGCGACAGTCA C l ' l ' 1 ' l 'l CTGTCGTG ATGCT GG AAAT CAATGTAG ACAATATCACCAACCAGG ATAACAGCAAAC 

GTG CCAAATGG AG TGI T ATTTT AGGAATTTTG ACCTTTGTC 1 ' l'l GGCATTCCTTCAGCCCTATCTTACGGTGTCATG 
GCGGATGTTCACA I V V l ' l GGTAAGAC LTllT V 1 GACGCTATGGA C 1 1C l ' l OO I ' ll C CAATCTCCTCATGCCATTTGG 
AGCTCTCTACCTTTCA IT1 1 1 1 ACAGGCTATATCTn'AAAAAGGCTCTrGGAATGG AGGAACTCCATCTCGATGAA 
AGAGCATGGAAACAAGGACTGTTCCAAGTCTGGCTCTTC C ri ' CI ' I 'C U 1 ' l ' l en CGTTTCCTCATTCCAATCATCATC 
30 ATTGTGGTCTTCATTGCCCAATTTATGTAATCAAAAAGGACTTGAGTAG 

4183.5 

ATGTTGAAAAAATGGCAGTTAAAAGATGTTATCTTGCrTGC'rrrClUUrCrATL'l 1111 IGGTGGGGl'l 1 ICOl 1GG 
TTCAGG ATATGTGTATAATATTCTCAGTCTACTCTTAACACCT C ' 1 ' 1 GO I ' l ' l O CAGGCCTTTCCCAATG AAATCCTCT 

35 TCGGTCTCTGGTGTATGGCTGCGCCCATTGCrGCCAT C IT 1 GT I 'CCGAGAGTCGG AAGTGCAACGATTGGAG AAGT 

GCTAGCTGCGCTTGCTGAA UIL ' L ' I I l ATGGTAGCCAATTTGGTCTAGGAGCT Cn ' l ' l GTCTGG C 1 ' 1 101 1 CAAGCTT 
TGGGAAGTGAATTTGGTTTTATCGTAACTAAGAATCGCTATGAAAGTTGGCTCT 

G ATTACG C 1 1 0 1 ' I ' AGCTTTGTCTATGAATACATTAAGTTAGGTTACTACGC C ' l ' l 1 ' 1 CCCTTCCGTTTGTCCTTTCCTT 
GCnVl O GTAC Oni 1 ATTTCTGTTTA IIICI" I CTGT ACCATCTTGGTTCGTGCC ATTGTCAAACTCTATCATC AGTT 
40 TGCAACTGGAGGAAAAGCATAG 

4183.6 

ATGGTCAAAGTAGCAACCCAGACACCGATTATCAGTCTCTTCTTGCTGATTTTATCCTTGGAAAC^ 
TTCGATTGCTCTG ACTCTTTCGGTAGTCGCA 1 ' l ' l lO 1 ATTCTCTTTATGCTCTATTACCGTCG ATTTAAAATGTTAG 

45 CTTGGATG ATCATACTTGCCATTTTACCAT C 1 ' IT 1G CCAACTACTGGGCAGTTCAGTTACACGG AGATGCITCACA 

GGCAGTCATGCTTGG AACGAGCGCC l'l' 1 0" 1 OACAGTTTGTATCGGCCTTGTC 1 1 101 1 1 CCTCTGTTTCACT AAAAG 

AGCTTCTCTTGTACTTGGCTCAAAAGGGGCTATCACGCTCTTGOT 

CCTCTCATTCAGCAAGAAATCAAGTCCCTCAAAGAAGCTTGCCTATTACGTGGT 

CCTTGATTTACAGTAAGGTTCTGATGACAGTCTTTAGGTGGCGCCA 
50- ATATGACGAACATGCACAGTTGAAGAATAGCTATCGGACTTTTTATATTCCTAAAAAA^ 

CTTTTCTTTTTATTGCTTCAAACCAGTCTATTTTTATAA 

4183-7 

ATGAGAAAGCACCAATTACAAGTTCACAAATTAACCATTTTATCTATGATGATTGCCCTTGA^ 
55 CTATCTTTCGAATTGAGGGAATGGCACCGATCTCCAGTGTAGTCAATATTCTA 
TTATGCCTrcGCTATGGCTACAOTCACAGCCTTTATCCCrrATGACGACT 
GGAGCGACTTTTGGAGCCCTTCTAGCAGGTCTCTTTTATAAGTACGGTCGAAA^ 
AGATTTTGGGAACAGGTATTATTGGTTCGATTGTTTCCTATCCTGTTATGGT 

C TTAG O 1 00 11 1 lA TCTACACGCCTCOATTTrTCGGAGCAACCTTGATTGGTACAGCGATTTCCTTTATTC 
60 A I ' l ' l 1 1 ' 1 'AATCAAGCAGOAATTCTTTAAAAAAGTGCAGGG ATA 1 1 1 CI ' 1 ' I 'AGTG AAAGG ATAGACTGA 

4183.8 

ATGCAGGAATTTACAAATCCCTTTCCTATAGGCTCTAGTTCCCTCATTCACTGC^ 

G ATGCTGGCAAATGGG ATTTTGGCTCTGGG ATGCAAACCTGTCATGGCAG ATGATTCCCGTG AAO l'l C I rGATTTT 
65 ACTAAGCAAAGTCAGGCTCTCTTCATCAATTTGGGGCATTTGTCAGCTGAGAAGGAAAAAGCAATCCGCATC 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



62 



PCT/GB99/02451 



GCrrCCTATCCAAA CCAAT CTTCT^ 

GCTTAGTTAAAGACCTTTTAGACTATAGACCTACGGTCCTTAAAGGAAACATGTCAGAAATTCGAA 
ATTAAAGCACCACGGCGTTGGGGTCGATGCGAGTGCTAAAGATCAAGAAACGGAGG 

AG ACTGGTGTCAG ACCTATCCTGGT A 1 G 1 CW'IU'CIU'AGTCACAGGTCCG^AGGACCTCCTCGTTTCGAAAAATCAG 

CTC CCTGTA CTCGGAAATGGCTGTACTGAATTAGACTGGATAACAGCGACAGGAGACT^ 

GCTum 1 1 iCTCAGCCAAGGAAAGACTGGTTTTGAAGCTTCTTGCnTAGCAGTCTCTTATCT 

GAAAATAGTTGTTCAAGGAATGGGATTGGAAGAATTrCGTTACCAAGTACTCA^ 

AGATGAAAATTGCCTAGATACCATCAAAGGAGAGGTTTATGAATAG 



4185.3 

ATGAACCATAAAATCGCAATTTTATC\GATGTTCATGGCAATGCGA 

AAAAATCAAGGGGCCAGTGAATATTGGCTTCrGGGAGATA I ' l l I I C n 'CCTGCTCCAGGCGCAAATG ACTTACTCG 
CCCTGCTAAAGGACCTTCCTATCACAGCAAGTGTTCGACX3CAATrGGGATGATCCrrGTC 

GCAATATGGCTTAGAAGACCCACAGGAAGTTCAGCTCrTGCGTATGACACAGTATTTGATGGAGCGAATGGATCC 

TGCAACGATTGTCTGCKTACGAAGCTrcCCTTTGCTGGAAAAGAAAGAAATTGACGGATO 

CATAATTTACCTGACAAAAACTATGGTGGTGACTTGCr 

GATGCGGAAACGGACGTGGCAGTTTATGGTCATGTTCACAAGCAGTTGC 

ATCATCAATCCAGGGTCGATTGGCATGCCCTATTTTAATTGGGAGGCGTTAAAAAATCACCGTTC 

TGATAGAAGTTGAAGATGGGGAATTACTCAATATCCAATTTCGT AAAGTTGCTTATGATTACGAAGCTGAGTTAGA 

ATTGGCCAAGTCCAAGGGGCTTCCCTTTATCGAAATGTATGAAGAACTGCGTCGTGACGATAACTATCAGGGGCA 

CAATCTGGAATTATTAGCCAGCrTAATAGAAAAGCATGGGTATGTAGAGGATGTGAAGAATTrrm 

TAA 

4186.1 

ATGAATGTAAATCAGATTGTACGGATTATTCCTACTTTAAAAGCTAATAATAGAAAATTAAATGAAACATT^ 
TTGAAACCCTTGGAmATGAAGGCCTTGTTAGAAGAATCGGCCTTTCTCT 

GCrGGTTTTAGAAGAAGCTCCCAGTATGCGTACTCGTAAGGTAGAGGGAAGAAAAAAACTAGCTAGATTG 

CAAGGTGGAAAATCCCTTAGAAATTGAAGGAATCrrATCTAAAACAGATTCGATTC 

AAmATGGCTACGCTTTTGAAATTTTCTCACCAGAAGATGATTTC 

CTAGTAGAAGTACWAGAAAAGCCTGAATTrCAAACAGATTTGGCATCA^ 

TGGAATTACATCTCCCAACTGATATCGAAACrmCTTGGAATCATCTGAAATTGGGGCA 

AGCTCAGGGGCACGATTTGACTGTGGACAATACGGTTACCTGGGAOT 

TTAGACATAGCAAGTCTrCGCCAGAAGTTTGAGTCTACTGAATATTTTATTCCTAAGTCTGA 

GTAAAGATACAAATAATGTTGAATTGTGGTTTGAAGAAGTATGA 

4186.2 

at gaag tggaccaagattattaaaaaaatagaagaacaaatcgaggcagggatrfatcccggagcctcttttgcg 
tattttaaggacaatcaatggacagagttctatttaggccagagtgacccag 
ggactagtttatgacctagctagtutcagcaaggttg 
gtcaattacatattgataoactggtaataoattttttacctgagagtgatta 

CTTGACTCATGCAj\CAGACCTTGATCCTTTTATTCCT 

ATGTTTCATCTCAACAGACG AAGTCAGCCAGCC 1 1 I U ' 1 1 ATTCGGATGTCCA 1 Mil IbClOl i GGGCTTTATTTT 

GGAAAGAATTTTTAATCAAGATTTGGATGTGATTTTAAAGGATCAAGTCT 

TAAGTTTGGGCCAGTTGAGCTTGCTGTTCCAACAGTTAGAGGTGTAGAGGCA 

TCGTCTCCTGGGT A G ACAT GCTGGG AG TGC'lUG 1UTATTTTCGACTATAAAGGATTTACAAATCTTTTTAGAACAC 

TATTTAGCAGATGATTTTGCAAGAGACTTAAATCAAAATTTTTCTCCTTTGGATGACAA 

GGAATrnW AAGG AGATTGGCTAGACCATACGGGCTATACAGGTACCTrTA 

AAGCG^CTATTTTCCTATCGAATCGTACCTATGAAAAGGACGAGAGAGCTCAATGGATATTAGACCGCAATCAAG 
TGATGAACTFGATTCGCAAAGAAGAGTAA 

4187.2 

ATGATGAAGAAG ACTTATAATCATATTTTGGTCTGGGG AGTCATTTrCTATAGCATTTGCA 1 ' ll i ' 1 ' C 1 G I ' 1 1 1 1G CTT 
TACr CCTC AAGAACAATCTACCGTGGGAGTGGG AACTCCAGGTATTCAGCATCTTGGACGCCTG G l ' lTl ' IC IT i I G 
ACTCCTTTCAATTCTC TCTGGAA ACTGGGCGAAGTGAGTGACATTGGACAATTATGTTGGAT^ 
TCCTCA ATGTCTTC 1 1G1 1 1 11 1 CCTCTG ATTTTCCAACTCCTTTATCTATTTCCAAATT^ 

GTCCT TCTTT TTAO 1 11 1 LTIXi 1 GACTCITGGAATCGAGTOTACGCAATTAATCTTGGA ClTnTl ' n ' l GATTTCAAT 

CGCGTCTTTGAGATTGATGATTTGTGGACCAACACTTTGGGTGGCTATCTGGCT^ 

ATAAAAACAAGGTAAGGAATTAA 
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4188.1 

ATGAAGAT TCCTCT CTTAACTTTTGCAAGGCATAAA I I 1UU 1 ATCTCTTGCTTACTTTG C 1 1 1 I ' lCITGCITl GCTT 
TATCGTGATGTnTG ATG ACTTA 1 1 ILI 1 1 1 ' I'l G ATATTCATGCGCCCG ATCT AGCTAAATTCG ATGG ACAAGCAAT 
TAAAAATG ACrTATTAAAATCAGCATTAG ATTTTCGTATTCTCCAGTTCAATCTAG G 1 ' IT H ATCAATCATTTATTA 
TTCCAATCATCAl llil 1 1 i GCTAGGTTTTCAATATATTGAGCTG AAAAATAAAO 1 ' I ' 1TACG ATTG AGTATTGG AAG 
AGAAGTGAGTTATCAAGGGTTAAAAAGAAAGTTGACTTTGCAAGTTGC^ 

ACTGTGCTGATAATTGCAATTATAACCTA I ' lTCl ' I ' l GGGA Crn ' r i CTCCTCTTGGATGGAATTCTCTATTTTCTGAT 
GG AA CTGGTTTACAAAGA CTCCTA GATCGAG AG ATAAAAAGCTA 1 T 1 U IT C I ' l ' I 'A C I ' l U l 'GTCCTACTAATCGCTA 
TT TTGATCAA TGCAATCTATTTTTTACAAATAGTTGATTATCT 
ATGTTTCTTTGGCTTGGTTCTATGCTGCnTATAGTGCCT 

AGCTAGCTATGGGGATGTAAGTTTGATGAAACTCTTTACTCCTTATATCCTTTATATTCT 
AAAAATATGAAGATAATGTTTAA 

4188.2 

ATGAAGATAATGTTTAAGAATmAACAATATTTTGCTAAATAGAAAGA ITU If I ' I' ACTACTTCGT AT AGTTCTG AT 
GATGATTTTGATAAACCATCTATTGTCAACAGCGGTTCAAAAGCAGGATGCTGTTATCr 1" ITI CAAGAGAGAATTG 
ATTTCAA rrrrri ' C CTATAATGA CTATT CTGAAGCGAATTTAGAAATCCCCAAACTATTGTTAAACCTT^ 1 
GATGGTAGGATGGCTCTCTGTCATTTrACTrcAAAGTGATTTGGCAGA 

TCAAGCTC C 1 ' 1 ' 1 ' I ' I CGATTATACAAGG AAACG ATTGGTTGTCATTTCT AAA IT 1 11 1 ACTCAAGA l ' 1 ' l O ITIXi TCTG 
0U'^^C■1M , GGT^*AC^^CCTCTAGGAATTCATITCAAAACAGTCGCACIUM , ^C , IT III ACTTGCTCAGTTAATGATGTT 
GTACTTACTACTGTCTTATCTGATAGCACTGATTAGTGCGGGCGCTG <j1 n 1 I C LTlTlTI 'CTCTA l ' lTITr AGCATT 
TGTGGGACAAGAATGGATGATGGATCATATTGTAACAGTGTATTTAGTACTCTTAAGTTTATTAGTTA^ 
GTTAGTCGCTTGGAAGAGAAATTTAAGAAAGGATAA 

4188.5 

ATGGGCAAAGGAGAGATGGCKlAAAGGAGTrATTCGCTTGGAGTTCGACrCAGAAGT 

ACCCTTCAATTGGCAAATGGTAAAACAGCGACTTTCCTAACCCAGTATGATAGCAAGACC1 ICjI 1UX 1 10CAGTAG 

ATAAGGAAGATATCGGACAGGAAATTATTGGTATAGCTAAAGGAAGCATCGAAAGTATGCATAATCTTCCTGTAA 

ATCTAGCAGGTGCCAGAGTTCCTGGCGGAGTAAATGGTAGCAAAGCAGCGGTGCATGAAGTTCCAGAATTTACAG 

GGGGAGTTAATGGTACAGAGCCAGCTGTTCATGAAATCGCAGAGTATAAGGGATCTGATTCGCr^ 

TACAAAAAAAGATTATACTTACAAAGCTCCTCTTXICTCAGCAGGCACTTCCTGA^ 

CCTCCT AGCTTCACT AGG ACT AACAGC 1TILT I'CCTIGGTCTGTTTACGCTAGGG AAAAAG AG AG AACAATAA 

4188.10 

ATGTTT AAj\GTTTT ACAAAAAGTTGG AAAAGC I'l " 11 ATGTT ACCT ATAGCTATACTTCCTGCAG CAGGTCT AC I'l ' I'l 
GGGGATTGGTGGTGCACTTTCAAACCCAACCACGATAGCAACTTATCCAATACTAGACAATACT^ 
ATATTCCAAGTAATGAGCTCTGCAGG AG AGGTTGTATTCAGTAA 1 T 1 U 1 C ACTACTTCTCTGTGTGGGATTATGTA 
TTGGCTTAGCGAAACGAGATAAAGGAACCGCTGCGTrAGG^GGAGTAACTGGTTACTTAGTTATGACTGCAACG^ 
TCAAAGCTTTGGTAAAA CTnT I ATGGCAGAAGG ATCTGCAATTGATACrGGAGTTATTGGAGCATTAGTTGTCGG 
AATAGTTCCCGTATATTTGCACAACCGATATAACAATATTCAATTACCTTCCGCTTTAGGATTC^^ 
CGCTTCGTTCCTATTGTTACATCGTT C t' C 1 ' 1 CT ATCTTG ATTGGCTTTGT dT C II ' 10 1 1 ATTTGGC CACCTT TCCAA 
C AA C I ' I CI ' 1 0 ' lT 1 CTACAGGTGC AT ATATTTCTCAGGCGGGTCCAATTGG AA CT I' l" f CTATATGG A 1 ' 1 1 1 1 AATGAG 
ACTTTCTGGAGCAGTAGGCTTACATCATATAATTTACCCTAT U 1T1 T G GTATACTG AACTTGGTGGTG V I G AAACTG 
TTGCAGGACAAACAGTGGTTGGAGCTCAAAAAATA 11 11 T 1UCTCAATTAGCCG ATTTGGCCCATTCTGGATTATT 
TACAGAAGGAACAAGG1 11 1 1 1 GCAGGTCGTTTCTCAACAATG ATGTTCGGTTTACCGGCTGCCTGT TTAGC GATG 
TACCATAGTCTTCCTAAAAATCGTCGTAAAAAATACGCGGGTTTG1 1 I'l I ' L UGAGTTGCTTTAACATC I'l 1 l"l ATTAC 
CGGTATrACAGAACCAATTGAATTTATGTTTCTATTCGTCAGTCCGGTTCTATATG 

GTGTTAGCT I C IT 1' ATTGCAGACGTCTTAAATATTTCAATAGGAAACACATrrrCAGGAGGTGTAATCGATTT CACT 
TTATTTGGAAT TTTGC AGGGGAACGCTAAGACGAATTGG U 11 CI ' I 'CAGATTCCATTTGGACTTATTTGGAGTGTTTT 
GTATTAT ATTA i'M'l' 1' AGATGGTTCATTACTCAATTCAACGTTCTAACGCCAGGGCGAGGAGAAG AAGTAGATTCT 
AAAGAAATTTCTGAATCCGCAGATTCAACTTCAAATACTGCA 

GAGCCTTGGGTGGATCAAATAATATAGAAGATGTAGATGCTTGTGTGACACGTrTACGTGTA 

TTAATCAAGTTO ATAAAGCAC 1 1 1 I AAAACAAATTGGTGCAGTTGATGTCTTAGAAGTGA-AGCGTGGCATTCAAGC 

AATCTATCGAGCAAAAGCAATCTTATATAAAAATAGTATTAATGAAATTTTAGGTGTAGATGATT^ 

4188.11 

ATOAAATTTAGAAAATTAG CriU I ACAGTACTrGCGGGTGCTGCG O " I I CI I GGTCTTGCTGCTTGTGGCAATTCTG 
GCGGAAGTAAAGATGCTGCCAAATCAGGTGGTGACGGTGCCAAAACAGAAATCACTTGGTGGGCATTCCCAGTAT 
TTACCCAAGAAAAAACTGGTGACGGTCrrTGGAACrrAT^ 

CAGATATAAAAGTGAAATTGGAAACCATCGACTTCAAGTCAGGTCCTGAAAAAATCACAACAGCCATCGAAGCAG 

GAACAGCTCCAGACGTACTCTTTGATGCACCAGGACGTATCATCCAATACGGTAAAAACGGTAAATTGGCTGAGT 

TGAATGACCTCTTCACAGATGAATTrGTTAAAGATGTCAACAATGAAAACATCGTACAAGCAAGTAAAGCTGGAG 

ACAAGGCTTATATGTATCCGATTAGTTCTGCCCCATTCTACATGGCAATGAACAAGAAAATGTTAGAAGATGCT 

AGTAGCAAACCTTGTAAAAGAAGGTrGGACAACTGATGArrrrGAAAAACTATTGAA AGCA CT 

TTACACACCAGGTTCATTGTTCAGTTCTGGTCAAGGGGGAGACCAAGGAACACGTGCCTTTATCT 
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AGCCKjTICTGTAACAGATGAAAAAGTTAGCAAATATACAACT 

gcaactagctggattaaagacaatttgatcaataatggttcacaatttgacgct 

gccaacggtcaaacatcttacacaatcctttgggcaccagctcaaaatggtatccaagct 

agtaaggtagaagtggtagaagtaccattcccatgagacgaaggtaagccagctcttgagtaccttot 

tttgcagtattcaacaataaagacgaca^gaaagtcgctgcatcta^ 

aaggagtggggacctaaagacgtagttcgtacaggtgctttcccagtccgtacttcatttg 

acaaacgcatg gaaa caatcagcggctggactcaatactactcaccatactacaacactattgatggatttgct 

aaatgagaacactttggttcccaatgttgcaatctgtatcaaatgctgacgaaaaacc^ 

ccttcactgaaaaagcgaacgaaacaatcaaaaaagctatgaaacaatag 

4188.12 

ATGCAA TCrAC AGAAAAAAAACC ATTA ACAGCCTTrACTGTTATTTCAACAATCA 1 1 1 1 GC r CTl G I' rGACTGTGC 

TGTTCATCTTTCCATTCTAC^GATTTTGACAGGGGCATTCAAA 

TGGTTCCCTAAAATGCCAACCATGGAAAACTTCCAACAj\CTCATGGTGCAGAACC 

AACTCAGTATTTATCTCATTGGTAACCATGTTCTTAtjl I IG ItKT AACCTCATCTCT AG CAGGTT ATGT ATTGGCT AA 

AAAACGmCTATGCTCAAraCATTCTAmGCTATCTTTATCGCTGCTATGGCGC^ 

TACGATTGGTACGTATCGTCAACTTCATGGGAATCCATGATACTCTCTGGGCAGTTATCTTGCCTTTC 

GCCATTCGGTGTCTTCCTCATG AAACAGTTCAGTG AAAAT ATCCCT ACAG AGTTGCTTG AATCAGCT AAAATCG A C 

GGTTGTGGTGAGATTCGTACCTTCTGGAGTGTAGCCTTCCCGATTGTGAj\ACCAGGGTTTGCAGCC 

TrACCTTCATCAATACTTGGAATGACTACTTCATGCAATTGGTAATGTTGA^ 

CTTGGCGTTGCGACCATGC^GGCTGAAATGCCAACCAACTA^ 

TTCCAATCGTCA(>CrrCITCCTAGTCTTCCAAAAATCCTTCACACAGGGT 

A 

4191.1 

ATGAAAAAAAC rn'rriCri'ACTGGTGITAGGCnU I'l'l'I'GC Cncn 'CCACTCTCT G I ' ll 1 1 GCCATTG ATTTCAAG 

ATAAACTCTTATCAAGGGGATTTGTATATTCATGCAGACAATACGGGAGAGTTTAGACAGAA 

TTGAGGAGGACTTTAAGGGCCAAATCGTGGGACTTGGACGTGCrcGTAAGATGCCTAGCGGGTTTGAG^ 

CTCATCCAAAGATTCAGGCCGCGAAAAACGGTGCAGAACTAGCAGATGTGACTAGCGAAGTAACAGAAGAAGOT 

GATGGTTATACTCTGAGAGTCTATAATCCAGGTCAGGAGGGCGACAT^ 

AAAAATTTACTTTTCCTTTATGATGATATCGCTGAATTAAATTGGCAACCT 

AAAAGTTTGAATTTCATGTAAGGGGAGACAAGGGGGCTGAAAAACTCTT^ 

AGGGAACGATTGAXAAGAGTAACCfTTjATTATACTATCCGTTTAGACAAT^ 

GCATGCCTATTGGCCTCGGACCGATTTTGCTAGCGCTAGCWATCAGCGATTGAAAGTC 

AATA AGATAGAAGACTCGATTGTTAGAGAAAAAGATCAGAGTAAACAACTCGTTACTTGGGTCCTCCCTr 

CTTTCCATCTCCTTGTTATTGAGTGTCTGCTTCTAT 

CAAAAATCATCGTCTCTATGAACCACCAATGGAATTAGAGCCTATGGTTTTATCA^ 

ttggagga^gtgagtcccttggtcaagggagctggaaaattcacctttgatca^ 
atctgatagaccgtgggaatgtctctatcattrcagaaggagatg 
gtttgtcaagctttgagaaagactgccta^atctagctttttcagct 
tgcggattacaaggtatctgatagtctttatcgtagagccaaagtttctg^ 

gcttcaactcaaatl'l 1 (jit 1' it j aag aggtattg aaccag atgcaagaagg agtgagaaaacgagtttccttctgg 
gggctcccagattattatcgtccmaactggtggggaaaaggccttgcaagtggct 
tgcc cctat ttatcggat ttgg ( 11 g i ' i' cttg tacagtitag acgttcatggctatctttacctccctttgccaat a 
cttggttttctagggttag rrrixii ci g it 1 1 ctattattggaagcttcgactagataatcgtgatggtgttctaaa 
tgaagcgggagctgaggtctactatctctggaccagttttg aaaat atgttgcgtgagattxjcacg 
gctgaactggaaagtattgtggtctggaatcgcctctt^ 
gtcattoatga^ggttcatcagarrcaagtggaj^aatccagatatcaatct 
taccttttatcattcaacagcacaaatgagccattatgctagtgtcgcaaatacagcaagcaccta 
tctgga^gtgga^gttctggtggtggcttct 1 1 1 aa 

4191.2 

ATGAAAAA AGTAA GAAAGATATTTCAGAAGGCAGTTGCAGGACTGTGCTGTATATCT(>GTTC 

CGATAGTTGCTTTACK>GAAACGCCTGAAACCAGTCCAGCGATAGGAAAAGTAGTGATTAAGGA 

GAGGAGCGCTTCTACKjAGATGCCGTCTTrGAGTTGAAAAACA^ 

AGGCGCAAjKCAGGAGAAGCGATATTTTCAAACATAAAACCTGGGACATACACCTTGACA 
TTGGTTATAAACCCTCTACTAAACAATGGACTGTTGAAGTTGAGAAGAATC 

AGGTAGAAAATCGAGAAGAGGCTCTATCTGACCAGTATCCACAAACAGGGACTTATCCAGATGTTCAAACACCTT 

ATCAGATTATTAAGGTAGATGGTTCGGAAAAAAACGGACAGCACAAGGCGTTGAATCCGAATCCATATGAACGTG 

TGATTCCAGAACGTACACTTTCAAAGAGAAmATCAAGTGAATAATTTTO 

GACGCTITAGTGGGAAAAC>GTGTATGAj\CAAAAAGATAAGTCTGTGCCGCTGGATOT 

CTCAAATAGTATGAGTAACATTCGAAACA^OAATGCTCGACGTGCGGAAAGAGCTG 

TATTGATAXAATTACATCTGATTCAGAAAATAGGGTAGCGCTTGTGACTTATGCT^ 

GAGTTTAC AGTA GAAAAAGGGGTAGCAGATAAAAACGGAAAGCGATTGAATGATTCTCTTTTITGGAATTATC 
CAGACGAGTTTTACAACCAATACCAAAGATTATAGTTATTTAAAGCTGACTAATGAT 
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TAAAAAATA AGCT ACCTACCGACGCAGAAGACCATC^ 

CTCAGAAACCTTTGATGAAGGCAGATGAGATTTTGACACAACAAGCGAGACAAAATAGTCAAAAAGTCAT^ 
ATATTACGGATG GTGTCC CAACTATGTCGTATCCGATTAATTTTAATCATC 
CAACTAAATGCATTTTTTAGTAAmATCTCCTAATAAAGATGGAATACTATTAAGTC 
5 GTGGAGAACATACAATTGTACGCGGAGATGGGCAAAGTTACCAG 

GTGCTCCTGCAGCTTTCCCAGTTAAACCTGAAAAATATTCTGAAATGAAGC 
TCCAATTAATGGTGGATATATTTGGCTTAATTGGAGAGAGAGTATTCTGGCTT^ 

AAATTACG^ATCATGGTGACCCTACAAGATGGTACTATAACGGGAATATTGCTCCTGATGGGTATGATGTCTTTAC 
GGTAGGTATTGGTATTAACGGAGATCCTGGTACGGATGAAGCAACGGCrACTAGTTTTATGCAAAGTA 
10 AAACCTGAAAACTATACCAATGTTACTGACACGACAAAAATATTGGAACAGTTGAATCGT^ 
GTAACTGAAAAGAAATCAATTGAGAATGGTACGATTACAGATCCGATGGGTC 
ACAGATGGAAGATTTGATCCAGCAG ATTA CACTTTAACTGCAAACCATGGTAGTCGCITG 

GTAGGTGGTCCACAAAATGATGG fGOl lib i'l'AAAAAATGCAAAAGTGCTCTATGATACGACTGAGAAAAGGATT 
CGTGTAACAGGTCTCTACCTTGGAACGGATGAAAAAGTTACGTTG 

1 5 TAAGCAATAAATTTTATG ATACCAATGGTCGAACAACCTTACATCCTAAGGAACTAGAACAGAACACAGTGCGCG 

ACTTCCCGATTCCTAAGATTCGTGATGTGCGGAAGTATCCAGAAATCACAATTTCAAAAGAGAA 
ACATTCAGTTTATTAAGGTCAATAAAAATGATAAAAAACCACrGAGAGGTGCGGTCm 
ATCCGGATTA TCCA GATATTTATGGAGCTATTGATCAAAATCKjCACTATCAAAATGTGAGAAC^^ 
GTAAGTTGACCTTTAAAAATCTGTCAGATGGGAAATATCGATTATTTGAAAAT^ 

20 CGTTCAAAATAAGCCTATCGTTGCCnTCCAAATAGTAAATGGAGAAGTCAGAGATGTGACTTCAATCGTTC 

GATATACCAGCGGGTTACGAGTTTACGAATGATAAGCACTATATTACCAATGAACCTATTCCTCCAAAGAGAGAA 

TATCCTCGAACTGGTGCTATCGGAATGTTGCCATTCrATCTGATAGGTIGCATGATC 

ACACACGGAAACATCCGTAA 

25 

4191.3 

atgaaatcaatcaacaaatttttaacaatgcttgctgccttattact 

cagtttttck:ggctgggacgacaacaacatctgttaccgttc^taaactatttc 
30 aaattgcaaatgagtt agaaa caogtaactatgctcgtaataaa 

ttgccggtgttatgttcgtttggacaaatactaataatgaaattattgatgaaaatggccaaact 

tattcatccacaaacatttaaactctcaggggcaatgccggcaactgcaatga^ 

agctaaatttaacacggcaaatttaccagctgctaagtataaaatttatgaa^ 

gctgaagatggagcaaccttaacaggttctaaagcagttccaarrgaaatrc 
35 g atg cgcatgtgt atcgaaaaaat acag aag caaagccaaaaattg at aaag atttc aaaggt aaagcaaatcca 

gatacaccacgtgtagataaagatacacctgtgaaccaccaagttggagatgttct 

AAAATTCCAG CACTTG CT AATTATGCAAGVGCAAACTGG AGCG atag AATG ACTG AAGGTTTGGCAT^ 

ggtacagtgaaagtaactgttgatgatcttgcacttgaagcaggtgattatgctctaacagaact 
ttgatttgaaattaacagatgctggtttagctaaagtgaatgaccaaaac 

40 attcggcaacattgaatgacaaagcaattgtagaagtaccagaatctaatgatgtaa 

atccagatg\cgggaatactccaaagccgaataagccaaatgaaaacggcgatttgacattgaccaagacatggg 
ttgatgctacaggt gcac caattccggctggagctgaagcaacgttcgatttgcttaatgctcagactggtaaact 
tgtacaaactgtaactttgacaacagacaaaaatacagttactgttaacggarrggataaaaatagkgaatat^ 
attcgttgaacgtagtataaaagggtattcagcagattatcaagaaatcactacagctggagaaat^ 

45 gaactggaaagacgaaaatccaaaaccacttgatccaacagagccaaaagttgttac^ 
caaagttaatgataaagataatcgtttagctggggcagaattnn"aattgcaaatgct 
ttagcacgtaaagcagataaagtgagtcaagaagagaagcagttggttcttacaacaaaggatgct^ 
gcagttgc igci i ataacgctcttactggacaacaacaaactcagcaagaaaaagagaaagttgacaaagctcaa 
gctgcttataatgctgctgtgattgctgccaacaatgcatttc 

so tg aaattagtttctg atgcacaaggtcgctttg aaatt acaggc c i i ci i gcaggt acat attacttag aag aaac 

AAAACAGCCTGCTGGTTATGCATTACTAACTAGCCGTCAG AAATTTGAAGTCACTGCAA C V I ' LTl 'ATTCAGCOACT 

ggacaaggcattgagtatactgctggitcaggtaaagatgacgctacaaaagtagtcaacaaaaaaatcactatc 
ccacaaacgggtggtattggtacaattatctttgctgtagcgggggctgcgattatgggtatt 
^ atgttaaaaacaacaaagatgaggatcaacttgcttaa 

4191.4 

ATGACAATGCAG AAAATGCAG AAAATCATTAGTCGT AT CI ' lUr 1 ' lXr ri 'ATGGCTCTGT G ' r it n ctcttgtatgggg 

tgcacatgcagtccaagcgcaagaagatcacacgttggtcttgcaattggagaactatcaggaggtggttagtc 
attcccatctcgtgatggtcatcggttgcaagtatggaagtttc 
50 attgtaagagacttgcattcgtgggatgagaataaal 1 1 IU I LI I icaaaaagacttcgtttgagatgaccttcc 
ttgagaatcagattgaagtatctcatattccaaatggtctttactatgttc 

ttctt atccagctg aa iticl'l'l'l ig aaatg ac\g atcaaacckttag agcctttggtcattgtagcg aaaaaaaca 

GATACAATCACAACAAAGGT GAAGC TOATAAAGGTGGATCAAGACCACA^ 

ttggtatcagtagcaagagatgtttctgaaaaagaggttcccttgattggagaataccgttacagt^ 
05 AAGTAGGGAGAACTCTCTATACTGATAAAAATGGAGAGA 1 1 1 1 ioiu acaaatcttcctcttgggaactatcgttt 
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CMCGACGTGGAGCCACTCGCAGGCTATGCTGTrACGACCCTGCATACGCATCTCCAGCrGGTAGATCATC^CCT 
GGTGACGATTACGGTTGTCAATCAGAAATTACCACGTGGCAATGTTGACTTTATGAACGTGGATGGTCTC 
TACCTCTCTTCAAGGGGCAATGTTCAAAGTCATCAAAGAAGAAAGCGGACACTATACTCC^ 
AAGGAAGTAGTTGTAACATCAGGGAAAGATGGTCGTTTCCGAGTGGAAGGTCTAGAGTATGGGACATACT 
J TGGGAGCTCCAAGCrCCAACTGGTTATGTTCAATTAACATCGCCTGTTTCCT^ 
AGGAACTGGTAACAGTGGTTAAAAATAACMGCGACCACGCATC 
TATATCTTtjATGCTTGTTGCCATTTTGTTGTTTGGTAG 

4191.5 

1 0 ATGAGCCACATATACTTATCTATTTTCACAAGTCTCTTGCTG 

AATATTTACGCATCGGTATGGAAGCAGCATATGCrCCCTTTAACTGGACCCAGGATGATGATAGCAACGGAGCrc 

TCAAAATCG ATGG GACCAATCAGTATGCCAACGGATACGATGTTCAAATCGCCAAGAAAATCGCTAAGGACTTAG 

GTAAAGAACCTTTGGTTGTTAAAACXAAGTGGGAAGGTCTAGTCCCTGCCCTTAC^ 

TATCGCAGGTATGAGTCCAACTGCAGAACGCAAACAAGAAATTGCCnTTTC^ 
1 J AGTTTTGCTTGTCAAAAAAGATTCTGCCTACGCAAGTGCT 

TCTCAACAAGGGGTCTACCTTTATAACTTTjATTGCACAAATCCCAGCT 

TTCGCTCAAATGCGACAAGCTCTTGAGGCnxrrGTCATTGATGCrr 

CTGAAGCTGCGAACrCTAAGTTCAAGATGATrcAAGTAGAACCTGGTTrCAAAA 

TCGCTATCGGGCTTCGTAAAAATGACAATCGTATTAGCCAAATCAATGC 
ZU ACCAAGTTGCCTTGATGGATCGTATGATCAAGGAACAACCTGCCGAAGCTACAACAACTGAAGA 

GTTTCTTTAGCCAAGTTGCTAAAATTCTTTCT^ 

ATCTCTATCGTCGGAACCATCATAGGTCTCATTATTGGA 

AAACAAAGTCATTTACGGCCTACAAAAACTAGTCGGCTGGGTTCTCAATGTCTA 

CCAATGATTGTTCAATCGATGGTTATCTA CTAT GGAACTGCCCAAGCTTTCGGGATCAACC 
25 TTGACCGTACACTGGCTGCTATCTTCATCGTnCAATCAATACCGCrrGCCTACATGACT^ 

TA TCCTA GCAGTTGACAAGGGACAATTTGAAGCTGCGACTGCTCTTGGTATGAC 

GATTGTCCTACCTCAGGTAGTCCGTAACATCCTACXTGCAACTGGTAATG 

TCTGTATTGAACGTTATCTCrGTTCrrCGAACTrrArrrCTCAGGAAATACCOT 

CCAGACATTTACAATCATCGCCGTGATTTACTTTTjTCCTCACCTTCACCGT 
jyj GCAGAATGGACATGGATACCTACACTACAGGTGCrAACCAAATGCAAACGGAGGAnTGAAATAA 

4191.6 

atgacacaagcaatccttgaaattaaacacctcaaaaaatcctatggacaaaacgaagtgct 
ctcactgtccacaagggagaggtcatctctatcatcggaagcrctggaagcggaaaatcgaccttcctacgctcc 

3 j attaacctacttgaaacaccaactgatggacaaattictttat 

ctcacgcaataccgtgaaaagttggggatggttttccaatccmaacctcm 
acacaatcgtcgctcagacaactgtcctaaaacgcgaacgcacagaagctgaaaagattgccaaagaaaacctg 
gaaaaggtcggcatgggagaacgctactggcaagccaaaccaaaacaactctcaggtg^ 
ggccatcgctcgtgccctctccatgaatccggacgctattctctttgatgaaccaacatcagctcrcgatccag 

w atgcttggagaagtcctcaaaatcatgcaggacctcgctcaggaaggcttgaccatgattgtcgtaac 
atggaatttgcccgtgatgtctctcaccgtgttatctt^ 

aagacctcttcaccaatcctaaagaagaccgaacaaaagagttccttcaacgctatctcaaataa 

4192.3 

« ATGAAAAAGTATCAACTTCTATTCAAAATAAGTGCAGTCTTCTCTTACTTATT^ 
GACGCTTATCGTCCAAAACTATTGGCAATTTTCTTCT 

TAii tilt A TTGO AGTC ATG A TTGTGG M"C ITU l'l AAGACAGGCCATGGTTATCTCTTCCGCATTCCAAGAAAAAA 
ATGGCTTTGGTATTCGATTTTGACAGTATTAGTGCTAGTGTTCCAGATCT C 1' IT [ AACGTTCAGACAGCTAAACATO 
TTCAGTG\ACTGCGGAAGGTTGGGCrGTATTGATTTXrrTATACT 

AGCCCTGTTCTrrCTGGTTCCACTG ATGG AAGAATTGATTTATAG AGG ATTACTGCAACATG C l ' l TC I 1 1 AAGCATT 

CGCGATTTGGTCrrGATTTGCni'MCCTT^^ 

ATATCTTCCTCTTrcCAACACTTGOAATCATCTTTGCrGGTTTGAC 

GCGGTGCATGTGATCAATAATATTGTAGCGACCTTCCCU 1 1 1 1 IGCrCACTTTTCTACATAGGGTCTTGGGGTAA 
5 5 4193.1 

atgaacaagaaacaatggctaggtcttggcctagttgcagtggcagcagt^ 

tcttctcgtaacgcagcttcatcttctgatgtgaagacaaaagca 

acaaatcattgaaccaatgagcttgggaaggtttgcaggctt^ 
Af\ gt ttca cttacttcc aatc aacaagtgaagctgactacgctaacaacttgcaacaagcggctggaagt^ 
oU aatcttcggtgttggttttgcccttaataatgcagttaaagatgcagcaaaagaac^ 

ttgattgatgatgtgattaaagaccaaaagaatgttgcgagcgtaactt^ 

c aggtgtgg ctg c ag caaaaacaactaag acaaaacaagttg gttttgt aggtggtatcg aatctgaagtt atct 
ctcgttttgaagcaggattcaaggctgg ro 1 1 gcgtcagtag acccatctatcaaagtccaagttg actacgctgg 
ttcatttggtgatgcggctaaaggtaaaacaattgcagccgcacaatacgcagccggtgcagatattgttt^ 
05 agtagctggtggtacaggtgcagg f U I L'l i igcagaggcaaaatctctcaacgaaagccgtcctgaaaatgaaaa 
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ACTTT TGGCnTATCGGT GTTGA TCGTGACCAAGAAGCAGAACGTAAATACACTTCTAAA 
CrTTGTTCTTCrrATCTACTrrGAAACAAGTTGGTACAACTGTAAAAGA 

TTCCCTGGCGGTCAAGTGATCGmACTCATTGAAGGATAAAGGGGTTGACrTGGCAGTAACAAA 

AAGGTAAAAAAGCTGTCGAAGATCCAAAAGCTAAAATCCTrcATGGAAGCGTAAAAGT 

TCCTGAAAAATAA 

4193.3 

ATGTCTAAAAAATTACAACAAATTTCGGTTCCCTTGATTTCTGTATTCCTAGGAATTT^ 

GTGG ATCTTCGGTT ATG ATGCTATTTGGGGCT ACG AAG AATTGTTCTAT ACAGC C I ' 1 ' rG GCAGTCTGCGTCCC ATT 
GGAGAAATaTCCGTGCTATGGG TCCT CTGGTCTTGATTGGTOT 1 
TAACGTCGGACTTCCTGGTCAGGC IT 1GGCAGOTTGG ATTCTCA G TOG 1TGGTTTGCCCTGTCGCATCCAGATATG 
CCCCGTCCCTTGATGATTCTAGCAACCATCGTGATTGCCTTGATTGCTGGTGGGATTGTCTC 
TTCTTAGGGCCTATCTAGGGACGTCAGAGGTTATTGTAACCATCATCATGAACTACATTGTCTTCT 
TGCCTrTATCCATGCTrrCCCTAAAGACTTCATGCAAAGTACAGATTCGACCATTCGTGTrc 

tatcagacaccttggttggctgagttgactgctaactcacggatgaatattgcta v 1T1 Cl ' l ' l GCCATCATTGCCGT 

TGCAGTTATTTGGTTCATGCTCAAGAAAACAACTCTTGGTT7TGAAA 

CAGAATATCKTrGGTATTrCTGCCAAGCGGACTATTATCCTATCTATGATTATT^ 

GGAGCTGTTGAAGGTTTGGGAACCTTCG\GAACGTCTATGTTCAAGGTTCGTCATTAGCT 

TGGCGGTTAGTTTGCTTGCGGCCAACTCACCAATTGGTATACTCTTTGCAGCCTTCCT 

GGGGCTCCTGGTATGAATGCGGCGCAGGTACCATCTGAGCnXTrCAGCATTGTAACAGCGTCTATTA 

TGVGTGTTCATTACCTTATCGAACGCTrnrrCAAACCGAAAAAACAAGTTAAAGG^ 

4194.1 

ATGGGAGTGAAAAAGAAACTAAAGTTGACTAGTTTGCTAGGACTGTCTCTGTTAATCATGACAGCC 
AATGGGGTAACTAGCGATATTACAGCCG AATCGGCTGATTrTTGGAGTAAATTGGTTTAL'rrC l'l" 1 GCGGAAATCA 
TTCGCTTTTTATCGTTTGATArTAGTATCGGAGTGGGGATTATTCTCTT^ 
CCAGTCTTTCAGGTGCAAATGGTGGCTTCTAGGAAAATGCACKjAAGCTGAGCCACG 

CAATATCCAGGTCGAGATATGGAAAGCAGAACCAAACTAGAGCAGGAAATGCGTAAAGTATTTAAAGAAATGGG 
TGTCAGACAGTCAGACTCTCTTTXjGCCGATTTTGATTCAGATG 

GAGTTGALTrri'I AAAGACAGGTCA l'l ICl'I ATGGATTAACCTTGGTAGTGTGGATACAACCCriGI 1CI ICCGATT 
TTAGCAGCAGTATTCACCI 1 1 M AAGTACTTGGTTGTCGVACAAAGCITTGTCTGAGCGAAATGGCGCrACGACTG 
CGATGATGTATGGGATTCCAGTCTTGA l'l I'll ATCTTTGCAGTTTATGCGCCAGGTGGAGTCGCCCTATACTGGAC 
AGTCTCTAATGCTTATCAAGTCTTGGAAACCTA l ' 1 ' l C riXi AATAATCCATTCAAG ATTATCGCAGAGCGCGAGGCC 
CTAGTACAGGC^CAAAAAGATTTGGAAAATAGAAAAAGAAAAGCCAAGAAAAAGGCTC^ 

4194.4 

ATGCrrTATCGATCCATTTGCTATCAACGAACTAGACTATTACTTACrmC^ 
CATACACAGCTGCAGCAATTCTCAATAATCCTAAGTTAGAGCATGTTAACriTTATC 

AATCTGGGAAGGATGGGaiXi nrCAAAAGAACGTATCATC G TI G I'I'AAACCAGGTGACACTATCGAATTAAAAGA 

TATGAAGATTCATGCAGTAG AATCATTTGACCGTAC I IGCl'l GGT AACTCTCCCAGTG AACGGTGCTG ATG AG ACA 

GGCGGTGAACTTGCTGGCrTGGCTGTrACAGATGAAGAAATGGCTCAAAAGGCTtr^ 

CAGGTGGAACCATCTATCATGGTGCAGATTCTCACTTCTCAAACTATTTTGCAAAACATGGTAAAG 

TGAT G 11GU1 1 GAATAACTATGGTGAAAATCCGGTAGGTATCCAAGACAAAATGACATCTATCGACCTTCTTCGT 

ATGGCAGAAAATCTGCGTACCAAAGTCATTATCCCAGTTCACTATGATATCTGGTCTAACTTC^ 

ATGAGATTCTAGAACTTTGGAAAATGCGAAAAGATCGCTTGCAATACGATTTCCATCCATTTATCTGGG 

CCGTAAGTACACTTATCCTCAAG ATCAACACTTAGTAG AATACCATCATCCACGTGG' r I' 1 ' I GATGATTGTTTTGAA 

CAAGACTCTAACATTCAATTTAAAGC 111 UCTATAA 



4196.2 

ATGTTCCTTTCAGGCTXXinGTCTAOl I I I GCTAATACTTATATCCATGATTTACTGGGGCTn L' 1 I 1 1 CCC AGATAG 
TC CA 1 ' l ' l ' 1 ' F A AATGC L 1 1 1 U AAAGTCCTATTGCGGCTCCTTTGGTAGAAC AACCCTTG AAATTATTGTCACT 
TT till'ri'G G C'ri'lU ATTCCTGTGCGAAAATTAAAATCTTTG 111' 11 ACTTGGAATTGCTTCCGCTTTG GGAT TCCAA 
ATGATTAAGGATATTGGTTATATTCGTACGGATTTGCCAGAGGGC1 1 1 G ACTTTACTATTTCGCG AATT TTAG AGC 
GTATCATCTCAGG AATTGCCTCTCACTGG AC 11 1 11 1 1 CAGGTCTAGCTGTAGTAGGTGTTTACTTGCTTTACAGAGCC 
TATAAAGGACAGAAGGTTGGCAAGAAACAGGGCCTTA 1 1 11 1 CT AGGTTTAGCCTTGGG AACTCACTTCTr GTTTA 
ACTCTCU 1 1 1 0 IGGAGTTGGAAACAGAGTTGCCTTTAGCGATTCCAGTGGTTACGGCTATTGCTCTCTATGG l'l 11 
TATCATGCTTA 1 TGCl ' l 1 ' G T T G AGAAACACAATGAGTTGATG ACCTAG 

4197.1 

atg aaggtgg aacc acgttgcg acgtcctttcg agg atgtcgc a i'l'l' t'l 11 attagg at actaattatgg agttg c 

aagaattagtggagcgcagttcggcaatccgacaagcttatcacgaactcgaagttaagcat 

gg acggtag aag aag acc i (j i i ggcttt atctaatg atattgg aaatitccaacg actggtg atgacaaagcaag 
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GACG CTACTA TGATGAAACACCCTACACXCTGGAACAAAAACTTTC^ 

CTC AACG TTTGGATATAGACATTCTGACGGAAATCGAAAACTTCCTCTCTGATAAAGAAAAGCAATTGAACg 
GGACTTGGAAGTAG 

4197.4 

ATGCTTGATTCGAA ACAA TT TTTTCTA CCCTATCTCCGCTCCCCT 1 1 L I 1T GGC 

Ai i 1 1 1 1 GTCTT ACTCTT TCAGTTTTTATTTGCCAGTCTAGGAATTTACTTCCT t lCMCl 1GTG7TGCTTT 

GTAACCATATTATTTTTCACTTGGGACATATTGGTGGAAACGCAGGTCT 

GGG AAGCCAAGTCTCCTTTGGAAATAGCTTTAGCAGAAAAATTAGAAGCGCGT 

GGTCAAAAGCAGAAAGAAAACTGACGGATTTGCTGGATTACTATACCTTGTXXOT 

TTGCACCC^CTCAACTCTTAGTTG CAGA ACTGGTCGACCGCCAACT 

AAATCGACTCCTATACCAACCTAGTTTTACAGTACCTGCGTTTAGAAAGTTTC 

GGTTCAAATTGAGGACTTGOTCAAGGAAATAATTCCTAAATATOCT L \ i 1 111 1 ' 1 ATTCAAAAAGGCTTAAATGTC 

AATCTACATGACCTTGATAAAGAAATCGTGACGGATAAAAAGTGGCTXjCTAGTGGTTATTG 

AACAGTCTC>AGTACACCAAGGAAGGTGGTCTGGAGATTTATATCKjATGACCAAGAGCTTTGTATC^ 

GGAATCGGGATAAAAAACA GTGAT GTCCTCCGAGTATTTGAACGTGGC^ 

AGCACrrCCTCTGGACTTGGCCTTTATCTATCTAAGAAAATTTCTGAA 

TGAGCTCGGAAAAGGAACGACAGTGCGGATTCAGTTTGCTCAAGTGAACTTAGTCCTTGAGTAA 
4211.2 

ATGGAACTTAATACACACAATGCTGAAATCTTGCTCAGTGCAGCTAATAAGTCCCACTATCCGCAGG 
CCAGAGATTGCCCTAGCAGGGCGTTCAAATGTTGGTAAATCCAGCTTTATCAACACTATGT^ 
TCGCCCGTACATCAGGAAAACCTGGTAAAACCCAGCTCCTGAA tl I 1 ['1 1 AACATTGATGACAAGATGCGCTTTGT 
GGATGTGCCTGGTTATGGCTATGCTCGTGTTTCTAAAAAGGAACGTGAAAAGTGGGGGTGCATGATTGAGGAGTA 
CTTAACGACTCGGGAAAATCTCCGTGCGGTTGTCAGTCTAGTTGACCTTCGTCATGACCCGTCAGCAGATGATGTG 
CAGATGTACGAATTTCTCAAGTATTATGAGATTCCAGTCATCATTGTGGCGACCAAGGCGGACAAGATTCCTCGTG 
GT AAATGG AACAAGC ATG AATCAG C AATCAAAAAG AAATTAAACTTTCACCCG AGTG ACG ATTTC ATCCTCTTTTC 
ATCTCTCAGTAAGGCAGGGATGGATGAGGCTTGGGATGCAATCTTAGAAAAATTGTGA 

4211.3 

ATGACAAAGAAACAACTTCACTTGGTGATTGTGACAGGGATGAGTGCCGCAGGGAAAACT 

TTCG AGG ATCTAGGTTATTTCACCATTG ATAATATGCCGCCAGCTCTCTTGCCTAA Ci l ' l T l ' l GCAGCTCGTTG AAAT 

TAAGG AAGACAATCCTAAGTTGGCCTTGGTAGTGGATATGCGTAGCCO 1 1 CM 1C1 1 1 1 CAGAGATTCAAGCTGTT 

TTGGATGAGTTGGAAAATCAAG ATGGTTTGGATTTCAAAATCCT C 1 1 1 1 1 GGATGCGGCTGATAAGGAATTGGTCG 

CTCGTTACAAGGAAACCAGACGGAGTCACCCACTAGCAGCAGACGGTCGTATTTTAGATGGAATCAAGTTC 

GTGAACTCTTGGCACOTrGAAAAATATGAGCCAAAATCnXjCT 

CAAAACCmGCAGAGCAGTTTTCAGACCAAGAACAAGCCCAGTCTITCCOTATra 

AAGTATGGAATCCCGATTGATGCGGACTTGGTCTTTGATGTCC Ci nTCTl 'GCCAAATCCCTATTATTTACCAGAACT 
GAGAAACCAAACGGGTGTGGATGAACCTGTTTATGATTATGTCATGAACCATCCTGAGTC 
CATTTATTGGCCTTGATTGAGCCGATTCTGCCAAGTTACCAAAAGGAAGCT 1 1 ' 1 GACCATTGCCATGG 

GATGTACGGGTGGACAACACCGTAGTGTGGCATTTGCTAAACGCnW 
TAATGAAGGGCATCGCGACAAAGACCGCAGAAAGGAAACGGTAAACCGTTCATGA 

4211.4 

ATGAGAAAACCAAAGATAACGGTGATTGGTGGAGGGACTGGAAGTTXCGTC 

GATGTGGAAATCGCAGCTATCGTGACGGTGGCAGATGATG G t G O II L'l' I CA GGTGAACTCCG AAAAAATATGCAA 

CAGTTG ACACCGCCAGGTGATCTTCGTAATGTCCTTGTGGCCATGTCGGATATGCCTAA Cj l ' 1 ' l ' I ATGAGAAGGTCT 

TTCAGTATCGGTTCTCTGAGGATGCCGGAGCCTTTGCrGGCCATCCATTGGGAAATCrCATCATTGCT 

AGAAATGCAGGGTTCAACCTATAATCCCATGCAGTTATTGAGCAAATTTTTCCATACAACAGGG 

TCCAGTGACCATCC 1' I " 1 0 ACCCTTCATGCAGTCnTCAGG ATGGGAGVGAAGTCGCTGG AG AG AGTCATATTGTAG 

ACCATCGAGGCATAATTCACAATGTCTATGTGACCAATGCCCTAAACCATGATAC^ 

TAGTGCAGACCATCCTTGAAAGTGACATGA 

CGTGATTAAGGAAATTGGGCGGGCTCTTTTGGAAACCAAGGCAGAAATTC 

ACGTGGGGAGACGGAACACTTTACAGATAGCGACCACGTGGAAGTCTTGCATOT 1 1 1ATC 

GACACTGTCTTGGTGAATATTGAAAAAGTXjCCTCAGGAATACATGAATTCCAACC G I'll 1 GATGAATACTTAGTGC 
AAGTGGAACACGA 1 1 1 I U i AGGTCTTTGTAAGCAAGTTTCGCGCGTG ATTTCATCTAACTTCCTTCGTCTGG AAAA 
TGGCGGTGCCTTCCACGATGGAGATTTGATTGTGGACGAGTTGATGCGCATTATAC^ 

4213.1 

atgaaaaatttgataaagttgctaataattagattgattgttaact 
tgtcgcacgttagcaataattattcttcgagc 

ttactaatl i mit lg gacg\g ttattgacagagtaaatccgcaaaaaattcttataatatcaattttggtrcaatt 

agcagtggctgt aata 1 1 i l'l attattattaaaccaaatatcattttgggtgataatgagtctagtgtttatttcag 

taatggctagctccataagttacgtgatagaagatctttntgattcctcaagtggtagaat 

tgcaaattctl 1 1 1 1 1 agtatttcgtataaagtattag attcta 1 1 ml aattcattcgcatca nn 1 acaggtgg 
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CAGTACGATTTATTTTATTGGTTAAGATAGATATAGGCATA I'lTll ACTrGCTCTATTTATATTXnTGTTGTTAAAA 
TTTAGAACTAGCAATGCGAATATAGAAAACTTCTCTTTCAAATATTACAAGAGAGAAGTGT^ 
TTT ATTTTA AATA ATAAAT TATTATTTAAAACCAGTATITCTTTAACGCTTAT I inn 1ATTCATTTCAGACA 

gtagttgtaccg a i ' n ' n tct attcg atattttg atggtccg a n 1 n 1 1 i atggta mi rrn aactattg ciooi ii g 

ggtggtatattgggaaatatgctagcgccaatcgtaataaaatatttaaaatcg 

1 r it 1' n o aacggctcaagttggttagtag caatt gttataaaag actatactttatgvcttattrra it ii ' 1 lli 1 1 1 

GTTTTATGTCTAAAGGAGTCTTCAATATTATTTTT.AATTCGTTOT 

AGGGTAAAT ACT ACCATTG ATTCTATTA 1 T I LI ' M ' IX3G AATGCC AATTGGTAGTTT AGTTGCAGG AACGCTTATTG A 
TTTGAATATTG AATTAGTGTTAATTGCTATTAGCATACCTTA 1 1 1 1 110111 lClT ATA ITlTn ATACOCATAATGO 
ATTGAAAGAATTTAGTATATATTAG 

4213.2 

ATGATGTCTAACAAAAATAAGGAAATTCTGATTTTTGCGATTCTCTATACAGTCCT 

attgctggc ncrr r aatgccatctgccattgcaaattatl' nor i t atgt agtttt ag ctct at atggctcc r i c r i 

CnTCAAGGATAGATTGATCCAACAATGGAAGGAGATTAGAAAGACTAAAAGAAAATTLl 1 CI I I GG AGTCTT AAC 
AGGATGCCTCTTTCTCATTCTGATCACTGTTGTCTTTGAArTTGTATCAGAGAT 
ATGGACAAGGTCTAAATCAGTCTAATAT TCAAA GTACCTTTCAAGAACAACCACTACTGATAGCTG 
TGTCATTGGACCTCTGCrrAGAAGAATTATTTTTCCGTCAGCrrCTrATTC 

TACTAAGCATTATTCTGGTAGG AC 1 1 li 1 1 1 1 1 GCTCTG ACTCATATGCACAGTTTGGCTCT ATCAG AGTGG ATTGGT 
GCAGTTGGTTACTT AGGTGG AGGCCTTGCL 1 1 1 1 CTATTATTTATGTGAAAGAAAAAG AG AATATCTACTATCCCC 
TACTTGTrCACATGTTAAGCAACAGCCTCTCCTTAATCATTTTAGCTATG^GTATAGTAAAATGA 



4224.1 

TTGAAAAAGCCAATTATCGAATTCAAAAACGTCTCTAAAGTTTTTGAAGACAGCA^ 
ATCAACTTTGAGTTGGAAGAAGGGAAATTCTACACCCTTCTAGGTCC^ 

ACATTATTCCAGGTTTACTGGATGCGACGACAGGAGATATCATGCTAGACGGTGTTCGTATCAATGATATr^ 
CAACAACCGCG ACGTACATACCGTCTTCCAATCCTATGC Cri ' liT 1 CCG\CATATGAATGTGTTTG AAAATGTTGCC 
TTTCCACTTCGCTTGCGTAAAATTGATAAGAAAGAAATCGAGCAGCGTGTAGCGGAA 

TGC AAGGTT ATG AAAAACGTTCCATCCGCAAAC 11 1 1 C llrG AGG ACAACGTCAGCGTGTGGCCATCGCCCGTGCTA 
TCATCAACCAACCCCGTGTGGTCTTGTTGGACGAGCCTrTATCAGCGCTGGACTTGAAACT 
GTACGAATTGCGTGAATTACAACAACGATTGGGCATTAC Crritj T Cl 1 101 CACTCACGATCAGGAAGAAGCTCTT 
GCCATGAGTGACTGGATITTCGTTATGAATGATGGCGAGATTGTCCAGTCTGGAACCCCTGTGGACAT^ 
AGCCAATCAACCAC l' II U 11 OCCACCTTTATCGGGGAGTCAAACATCTTGCCAGGTACCATGATTGAGGACTACTT 
GGTCGAATTTAACGGCAAACGC 1 1 1 GAAGCGGTTGATGGTGGG ATGAAGCCAAATGAACCTGTTGAGGTCGTTAT 
TCGTCCAG AGG ACTTGCGCATTACCL' 1 1 ICC IG AAG AAGGCAAGCTCCAAGTTAAGGTCG ATACCCAGCl'l I ICCGT 
GGAGTrCATTATGAAATTATCCCCTATGACGAACTTGCAAATGAATGGATGATC 

tgggtgaggaaatcggtctggactttgaaccagaagacatccacatcatgcgtct 

tcgatgctcgtattgaggagtacgtagaaatcgaagagcaagaagcaggtttgatcaatgcaatcgaggaggaa^ 

gagatgaagaaaacaagctctaa 

4252.1 

ATGAAATCAATG AG AATCTTA l" rrri'G'lTAGCTTTAATTCAAATCAGTTrGAGTAGCTGTTTCCTATGGAAGG AAT 

gcatctt gtcc tttaaacaaagtacagc r rrrr i catcggaagcatggttttcgtttcagg aatctgtgctggagt 

AAATTATCTTTATACCCGTAAGCAAGAAGTCCATAGTGTCCTAGCCAGTAAGAAGTCGGTGAAGCrrr IT l'ACAGT 
ATGTTACTCTTAATTAA lTlti 11 AGGAGCT G I I C 11 0 1 ' ITI GTCAG ATAA C I TCj IT CATCAAAAATACGCTGC AGCA 
AG AATT AGTTG A C l'l T I ' 1 ' ATTGCCATC C lll'I 11 Tl CCTATTTGGGCT AG ATTTGCTG A 1' I'll' 1 1 1 ACCCTTG AAAAA 
ATACGTGCGCG A 1 11 1CI 1 UCTATGCTGGACAG AAAAAAGACAGTGTTCGTG ACTATTTTAGCAAC^ 
TTAAGAAATCCAATG ACCATTGTCTCA C 1 1 CI G ATTTATATTGG ACTGGG C I ' l G ' 11 1 1 1 1 GCAGCCTATCTTGTCCC 
AAATTCGG'ITAAGAAGGAAGTTTCC 111 1 ATGGTCATATTTrCCGAGATCTTCTATTGGTCATTGTTACGCTCATTT 
TCTTTTAG 

4252.2 

ATOCTTAAAAAAATTATTGGAATGGTtiCTAGCTTTACTTTCTGTAACTGTACT I GCTTATAC 

TATTTATCAACAAGGGACAGAAACCTTAGCTAAAACCTATAAAAAAATCGCrrGAAGAAAC 
GACTGAACCTCTAACCATTCTGTrAATGGGAGTCGACACCGGAAATGTKjAACGAACT 
AAGTGATAGCATGATCTTGATGACAGTCAATCCTAAAACGAAAAAAACAACAATGATGAGT^ 

tctgacgcgcattgaatcagggaatggtcaggctcatgaagcgaaactgaact^ 

agagcttgctatagaaaccattcaaaaaatgatgaatatccatattgatcgctatct 

attgcaaaaactagtgg atgcagtagg aggtattacagtca at aatatcctaggtttcccaa im ' 1 v f atcagtc 

caagaagaatttaatactatttctatcggtcnggggagcaacatattgggggagaagaagccctagtct 

cgaatgccrttaccaagatcctgagggggattatgcrrcgtcaaaaacgtcaacgtgaacrt^ 

gaaaaagctctcagtttaaatagcattcgtcattatcaagagattctaaaagctttgagtgacaa 
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A ATAT TG A 1 1 1 u t CTGCAAAAAGTATCCCTAACTTCCTAGGCTATAAAGATTCATTTAAAACCATrGAAACTCAC^ 
AGTTGCAGGGTGAAGGAGAGATACTTCAAGGTCnTrCTTACCAGArrGmCGAGAGCACATATGTO 
AAATCTACTCCGACGTTCTTTGGGACAAGAAGAAGTTACTCAGCTTGAAACCAATGCGGTTT^ 
^ TTTGGCAGAGCACCTGTTGGTGATGAAGATAATTAA 

4236.2 

ATCAAAAA ACAAG CCTATGTCAT TATTGCT CrCACCTCCTTCCTA i 1 1G1U 1 inni CTCCCACAGCTTGCTCCA 
AATACTTGATTTTGACTCGTCTATCTTTTTG CACGA TGTCGAAAAAACAGAAAA A M ' 1 0 1 C I 11 ITATTGTTGGTTT 
TCAGCATGTCC ATGA CCTGTCTCTTAGCCCTGTTTTGGCGAGGGATCGAAGAGCTTTCT 

1U TAATCTCAAGCCTTTATT AGCAG GGCAAGAAGTGCTTCAGOTTGCAGATCCAGATTTGGATC 
TTATCAGGTAAACnTAACCTTTTGACAGAGGCTCTTCAAAAAGCTGAAAATCAGAGCCT^ 
ATCA TCGA GAAGCAACGGAAGCGAATTGCTCGGGATTTGCACGATACAGTCAGTCAGGAGTTXjITro 
ATGATTTTATCGGGTATCAGTCAGCAGGCTTTGAAATTGGATA 
ACAGCTATTTTAG.AAACAGCCCAGAAGGATTTGCGGGrm 

1 5 AGAGCTTGATAGAAGGGATTCAAATTCTTTTAAAAGAGCTTGAGGACAAGAGTG 

AGAATATGACGAAATTGCCTAAGAAAATCGAGGAGCATATCTTCCGTATCCTGCAAGAGTTGATTAGCAATAC 

TCCGCCATGCCCAGGCATCTTGCCTAGATGTCTACCTCTATCAGACAGATGTTCAATTGCAACTGAAGCT 

CAATCGGATTXjGTTTCCAGTTAGGGAGCTTAGACGACTTGAGTTATGGACTGCGAA^ 

. . agatatggctggaacagttcaactcttgacagctcccaagcaagggctggcggttcatatc 
20 gataacgaatga 

4263.1 

atgattgtttccattatttctcaaggatttgtctgggctattctaggtctgggaatctt^ 
aaactttccagatatgacgacagaaggttccttccctcttgggggagctgttgctct 

25 gtgaaccca1 1'l'i t agcg acacttgttgctgta ggagc ag o 1 1011 \ ggctgg aatggcagcaggc l ' i ' l c 1 ' 1 ' 1 ' ata 
caaaagggaagatcccaacotgctctcagggattttggtgatga c1 id 1 g 1 cactcaatcatcctcttg attat 
ggg acgtgcg aatttagccctgcttgg aaccaagcaaattcagg atgttttgc c i ' ltru attcgg atttgaatcaa 
ctcttg ac aggtctcatctttgtg agtattgttattgctctcatgct c ri ' l ' i ' i ' c ' 1 ' 1 'gg acactaaactcggacaagc 
ctatattgctacaggggataatcctgatatggctagaagtttcgggattcatactggacgcatggagctcatgggc 

30 ttggtcttatcaaatgctgtgattccccttgcaggtgccctcattgctcagcaagaag 

gaggg atcg gggttatcgttgtggggcttgcaagtttgattattggagaagttattttc^ 

AGAGCGTTTGGTTACTATCCrrrCrrAGGTTCTATCGCTTATCAATTTTTA 

ATACAAGTTACCTTCGTITATACAGTGCCTTGATTTTAGCAGTCTGCCTCATGAT^ 

CTTGAAAGGAGCCAAGTTAAGCAAATGA 

35 

4346.1 

ATGAAAAAAATGAAAGTTTGGTCTACTGTACTTGCAACGGGAGTTGCTCTTACTAC^ 
GTTCAAATTCTACGACrGCTTCTTCATCTGAAGAAAAAGCItiATAAAAGTCAAGAATTAGTTATCT 
AGTCTCAAATGGTCGTGCTGATTGGTTAACTGCTAAAGCAAAAGAAG C 1 go r rn AATATAAAAATGGTTGATATC 
40 GCTGCCGCTCAATTAGCAC^CCGTGTTATTGCTGAGAAGAATAATGCAGTTGCAGATATGGTA 
CTGTTGATTCAAATAAAATTAGAGATCAAAAATTACTAGTAC^GTACAAGCCT 

AATCTTTATCAGATAAAGATAATTATTATAATCCTGTGATTGTTCAACCATTA O 1 i 1 1 AATTGGGGCGCCTG ATGTA 
AAAGAAATGCCTAAAGATTGGACTGAATTAGGTAGTAAGTATAAAGGTAAATATTCAATTTCTGGTC^ 
GGTACAGGACGGGCAATTCTAGCAAGTATCTTAGTTCGATACCTrGATGATAAAGGTGAATTAGUiti HI CCGAAA 
45 AAGGTTGGGAAGTAGCAAAAGAATATTTGAAAAATGCATACACTCTTCAAAAGGG^ 
TGTTAGACAAAGAAGATCCAATACAATATGCAATGATGTGGGGTTCTGGTC 

a :g ncvrn caaagttatgactcctgagattggtgtaccatttgtaactgaacaaactatgo rrii aagcactag 

TAAAj<AACAAGCGTTAGCTAAAGAATTTATTGATTGG 
CTTTGGATCTATTCCTGCAAATAAAGATGCCCTCAAAGATCTACCTGA^ 
00 AAACCACAAAATATTGACTGGGAAGCTGTrGGAAAGCATTTGGATGAATGGCT 
GTACAATAA 

4346.2 

ATGATTAAATTTGATAATATTCAAATTAAATATGGTGA 1 ' 1 ' 1'IXj 1'1 'G CAATTG AT AATCTG AATTT AG AT ATACATG 
55 AAGGGGAAirriTlACAniCl 1 GGGCCTTCAGG ATGTGGTAAATCAACTACTrrGAG AGCATTGGTAGO ITl'l CT 

AGATCCATCATCAGGAAGTATTGAAGTTAATGGAACAGATGTCACTXZATTTGGAAC 

TATTGTATTTCAATCTTATGCGCTATTTCCAACTATGAL 1 0 IT 11' 1 GATAATATTGCATTTGGTTTAAAACrrTAAGA 
AGGTAGCTCCAGATGTTATTAAAGCTAAAGTATC^GCAGTGGCAGCAAAAATTAAGATCTCTGATCAACAGTTAC 
AGC GTAAT GTATCAGAATTATCTGGGGGTCAACAACAAAGGGTAGCATrGGCrCGTGCTL' 1 OO 1 1 I 1 1 GAACCTAA 
60 AATTCTTTGTCTAGATGAACCATTGTCAAACCTTGACGCAAAATTACGTGTAGATT^ 
CTTCAAAAAGAGTTAGGTATTACTACTTTATATGTTACTCATGATCAAGA 

TTGCAGTCTTTAACAATCGATACATCGAACAGGTCGGTACACCAGTAGAGATTTATCATAATTCTCAAACTGA^ 
TGTAT GTGATTTTATTGGAGATATTAATO 1 [ 1 1'G ACCG ATG A AACAGTCCACG AAGTATT ATTG AAAAATA CAAGC 
O i'1'1' ILM' IAGAGG ATAAAAAAGGATACATTCG ATTAG AGAAAGTTCGATrCAATCGTGAAACTGAACAAGATTTTA 
65 TTCTAAAAGGGACAATTATTGATGTTGAGTTTTCTGGAGTTACAATTCACTATACAATAAAAGT^ 
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GATTCTTAATGTAACAAGTATTGATACTCAGGCTGCTATTAGATCTCTCGGAGAAAGTG^ 
CCATCAGACGTTCTGCAATTTTAA 

4346.3 

ATGCGTCATAAATTAAATTTAAAAGATTGGCTTA1 1 CO I IT AGGCTTAATC 'IliU 1 1' 1 ' 1' AGT AACATTTATT ATTTA 
TCCAAACTTTGATCTAGTAGTGAATGTAl 1 1 0 i AAAAGGAGGAGAATTTTCCCTTGATGCTGTACATCG IGTTCTA 
AAATCTCAGAGGGCACTTCAGAGTATTATG AACA O 1 IT T AAGTTAGCATTTTCACTCATTATT ACAGTTAATGTCG 
TAGGTA ITC'll'l iiiXi nUl'ATT TACAG ACTACTTTGATATTAAAGGTGCTAAAATTTTAAAA 
TCTTTAATn'ATGG AGGACJ IXKi'lT'I' I' AGCGACTOGATATAAA 1 1 10 11' I ATGGTCCTTATGG ATTGATT ACAAAATT 
TTT ACAAAATGTT ATCCC 1 ' 1 C 1 1 1 AG ACC CTAA C 1 G G 1 11 ATTGGGTATGGTGCACj IXJI'l ATTCATTATGACATTTT 
CAGGAACTGCTAATCATACA 1' IXJ I'l'l ' 1 'I'AAC AAATACAATTCGAAGCGTTGACTATCACACTATTG AGGCTGCTCG 
aaatatgggagcaaaaccatttaci g rrr 1 CCGAAAAGTAGTGTTACCAACCTTAATTCCAACTCTATTTGCACTT 
ACT ATT ATG Ci' l'l'l'l 1 Cl'l AGTGGTTT ATCTGC AGT AGCAGCACCCATG A 1 TO I ' r' GGTGGT AAAG AATTTC AAACT AT 
AAATXrC^ATGATTATTACATTrGCAGGGATGGGGAATTCTCGTGATTTAGCT 
ATTGCAACTACAATTTTGCTTACTATCATGAATAAGATAGAAAAAGGTGGAAATTATATTTCrATCT 
AAGCGCCTCTTAAAAAACAAAAAATTGCGTCTAAGCCTTGGAATATCATrGCT 
CACACTTTTrCATGCTrCCACTAATTTTT AT ACTTATT ATACTIC 
TAACATTATCAAACTTTACTTTAGAAAATTATCGCrrATTCTTTAGTA^ 

AGCTTTATTTATTCTATTATTGCTGCGACAACAGCAACAATTCTCGCA O MOr ATTTGCIXXi'lti fTGTC AGAAAACA 

TAAAT lTtJO 1 ' IT I GA I ' l ' l Cri 'ATTTGAATATGGTGCTCTACTTC L I 1 OU 1 ' I ACTACCAAGTACA CT I ' l ' l AGCAGTAA 

GTTTATTATTTACri 1 ' 1 AATCAGCCACAA IT I C !'l GTCTTG AATCAG A 1 11 1 OCT AGGT ACT IT IXjGT AATTCT ACTT 

ATTGCATATATAGTTGTAAAAATCCCA r n 1 LT 1 ATAGAATGGTACGTGCTATTTTATTTAGTGTTG A TG ATG AG AT 

GGAAGATGCAGCAAGAAGTATGGGTGCTTCACCrrrrrATACTATGATGAAG<nTATCATTCCATTT 

G HO 1 1 1 CTCTCrGTTATTCCTITAAACnTAACTCTTrATTAACT 

CTAGCTCAACCATTAGGTArrACGATrCGATCTGCAGGTGATGAAACAGCAACATCTAA 

1 101 1 1 AT ACAATTGT' I'CI GATGATTATTTCTGGAACGGTATTATACTTCACACAAAGACCGGGCCGT AAACT 

GAAATAA 
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Tabic 2 



MEEIVTLDCLFTDRTKIEANANKYSFVWKKTTEKFSAKLQEQIQVYF 
HYWKEDSYTHPDGWYYRFHHTKYQKTQTDF^ 
5 AQRKIDVEPVrcQDCASLGYKRC>0JlGKRQVRJDMGLVUitANNLLKYSKMK2 

MGKGHWNRKRVYSIIUCFAVCACSVMICTCAVLLGCNIAGESVVYADETUTHTAEKPKEK 

EKTEQSEPSSTEAiASEKKEDEAVTPKEEKVSAKPEE^^ 

WYFK1~NANSKEA1KPDADVSWKKLDLPYDWSIFND^ 
10 DGVYWDSQVYVNGQLVGHYPNGYNQFSYDrTKYLQKDGRENVlAVHAVNKQPSSRWYSG^ 

EKNGTmTPKLEECXJHGKVErHVTSKIVNro 

PKLWTVLNDKPALYEUTRVYRDGQLVDAXKDLFGYRYYHW^ 

YRRLXQMKEMGVNSIRTTHNPASEQTLQUAELGLLVQEEAFDTWYGC^^ 

UrraVERGKNNPAlFKWSIGNEIGEANGDAHSlj\TVKRL^^ 
15 VGFNYSEDNYKALRAKHPKWUYGSCTSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFD 

RDNAGYAGQFTWTGTDYTGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFVXYQSQWVSVKKK^ 

ElJVSKVADSEGKIPVRAYSNASSVEUO^GKSLGLKTFN^^ 

GKEMRDKrrTAGKPAAVRUKEDHAL^DGKDlTO 

YKAQADGSWIRjCAFNGKGVAMCSTEQAGKFTLTAHSDLUCS^^ 
20 TVPFVYSDGSRAERPVTWSSVDVSKPGrVTVKGMADGREVEARVEVIAUCSEl^ 

VEEYEVDKWEL\EEDKAKLAIPGSRIQATGYLEGQPIHATLVVEEGNPAAPAV!TVTVG 

YGAKLPEVTASAKNAAVTVUjASAANGMRASinQPKDGGPL(^YAIQFL£EAP(CIAHLSLQVEKAK 

RAHYQDCTQANa.PADKVTFSTSGEGEVAIRKGMLELHKPGAVTLNAEYEGAiOX5VELTIQA 

DLHQEPSLPATVTVEYDKGFPKTHKVTWQAIPKEKLDSYQTFEVLGKVEGlDL£ARAKVSVEGn^SVEEV 
25 QU>ESVRTYDSNGHVSSAKVAWDA1RPEQYAKEGVFTVNGRLEGT^^ 

ASDSNPSDPVSNVNDKUSYNNQPANRWTNWNRTNPEASVGVL^ 

YVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMWFSF 

QIFAKQVAAAKQCK^QVDGKDLj\NFNPDLTOYYLESVDGKVPAVTASVSNNGLATVVPSVREGEPV^ 
ILGEYRLHFTKDKSLLSHKPV AA VKQARLLQ VGQALELPTKVPV YFTGKDG YETKDLTVEWEEVPAENLTKAGQFTVR 

30 GRVLGSNLVAETrVRVTDKUjCTXSDNPNYDENSNQAFASATNDIDKN^ 
EVSAGVIFRENGKIVERTVTQGKVQFFADSGTDAPSKIVLERYVGPEF^ 
ADKDIAAGDEINVTFIUIKAKAMRWRMERKADKSGVAMIEMTFI^PSEL^ 
YKGQRPKVSVEENNQVASTVVDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAA 
FCTVEKKDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKU^ 

35 SS EA5QTNKAQLPSTGSAASQAAVAAGLTLLGLSAGLVVTXGKKEOZ 

MKIMKKKYWTI^LFFCU^NSVTAQEIPKN 

TDICHLEKNCCKI£LEPQINNDIVNSESNNLLGEDNLDNKIKENVSHLD^ 
GESYKLYSKSNSKVSLULDSGVDLQrrTGLUCNU^ 
40 . INGVNPHVMNVYRIFGKSSASPDWIVKAIITMVDDGNDUNLS^ 
UVAALGNDSLNVSNQSDLLKLISSRIUCVRKPGLVVDVPSYFSS^ 
GUWFINAEKYKEDWffSATLGGYTYLYGNSFAAPKVSGAIAMI©KYKL^ 

MKKTWKVFLTLVTALVAVVLVACGQGTASKDNKEAEtXKVDFTL^ 
45 PEESS5DLVtNGKAPFAVYFK}DYMAKiaEKGAGrrAVAAlVEHNTSGllSRKSDNVSSPKDLVGKKYa 
KTLVES(^GDFEKVEKVPNNDSNSrTPL\NGVFOTAW 

YUCDNKEEARKVIQAKKGYQYAMEHPEEAAOIUKNAPELKEKRDFVIESQKYI^KEYASDKEKWGQFDAARWNAFY 
K WDKENG ILKEOLTDKG FTNEFVK2 

50 MKRTWRNSFVThOJfn>FMIGNIEIPNRTVU^ 

GENPVSlQLrcSDEDSLARAAETCQEhnTCmVDINMGCPVNKlVKN^ 

TCWAOPSLAVENALAAEAAGVSALAMHGRTREQMYTGHAOtETLYKVAQALTKIPFlANGDIRTV 
DAVMIGRAAMGNPYIJhKJlNHYFETGEILPDLTFEDKMKUYEHUCRUNLK^ 
^ RGAISQ A5TLAEIETLLQ LEKAZ 

MKNPiaLTKSFUlSFAlLGGVGLVfHIArYLTFPFVTTQLEGEKFNK 
RDIVDKRLPLVHDLODCZXjKI^WIVMLDMSVSTADGKQVTVQFVHGVDVYKEAK^ 
YTKRU^PLFYtSEVTSKMQDU)DNIRFDESRKDEVGEVGK^ 
HEUCTPLASLRIIL£NMQHNIGDYKDHPKYIAKSINK1DQMSH^ 
60 SIGVTIENQLTDATRWMSLRAU5KVLTNUSNAIKYSDK^ 
K0ESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKI5LZ 

MYIXDLMEKAECGQFSIl^FLLQESCrTTVKAVMEETGFSKATLTKYVTLLND 

GRDIRSLFI^AVKYQILVYU.YHQQFLAHQLAQELVISEATLGRHLAGLNQILSEFDLSIQNGRWRGPEHQ1HYFYFCL 
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FRKVWSSQEWECHMQKPERKQEIANLEEICCASLSACQKLDLVLWAHlSQQWJlVNACQR5VmEKMRGYFDNIF^ 

LUtKVPSFFAGQHffLGVEDGEMMIFFSFlXSHWLPLHT^ 

SQLCAQVYLYKGmQDRYKYQLENRHPYlXMEHDFKETAEElFH^ 

HMRIGLDLTSGFLVFSRMAAlLKRYL£YNRFnTEAYDPSRHYDU-VTNN^ 

TZ 

MEFSKKTREl^KKMQERTU)UJIGGGrrGAGVALQAAA5GLETGUEM 

VSDTVSERAWQQIAPHIPKSDPMIXPVYDEIXjATFS^ 

EGLVGGGVYU>FRNNDAJU.VIEMKRANQDGALIANHVKA£GFLFDE 

SDKVRNUNKGTQFSQMRmGVHLVVDSSKIKVSQPVYFDTG^ 

QEDVDYlXGrVNNRFPESNTTTDDUISSWAGLRPLLAGNSASDYN 

SKLESSTSEKHU5PSAVSRGSSLDRDDNGU-TIj^ 

PAhfVTOSEIEAFAQUjVSRGLDSKEAHYLANLYGSNAPKVFALAHSL 

TNHMI^MRDSLOSIVEPtLi>EMGRfYDWTEEEKATYRADVEAAijVNNDLA£LXNZ 

MMNEI^EFLGTUIJLLGNGWAGVVIJ>KTKSNSSGWW 
WASVUMTIlJVQFAGAMLGQlLVWUJFKPHYEAEENAGNI^^ 

GICTFAVGTUVGIGULGGnGYALNPARDLGPRlMHSILPIPNKGDGDWSYAWIPVVGPVlGAALAVLVK 

MTKKKIERISVIHREKILWUCWYFMRDKEQPKYSVLERKMFDAAKNQDKO^ 
VYVYNHMNVIGACQRILFISQSPAYDKLNKWFNIYSDLYFSVVPLPKMGVYHEMVGIZ 

MKNSNEAEMKIXYTDIRTSLTEILTREAEELVAAGKRV^ 

NDU»A10TU5DlGLGLAFYKClj\EUDPia>LRVYGAiKQDPQLiQQUELYHEMTKSQMS 

EKVTAYLNQGQL^QESQLSHLIEAIENDKVSSDFNQIALVIDGFTRFSAE 

EGNLYQASVKFUiHI^KYQTPAQDCSQTHEKMDSFDKAS^^ 

VARSDlQKUfENSDLSYKHFRIlXGDVASYQLSLKTIFDQYQlPFYLGRSEAM 

LRTDLYTDL^SDIDAFEQYlRYUjlNGLPAFQQTFTKSHHGKFNLERLNVt^^ 

FLKEGAVTKQLQDLTTTLEAVEQERQAEVWKAFCHVLEQFATVFAGSQ^^ 

VQSYDUAPLTADf^YAIGLTQDNLPKISQNTSlXTDEERQN^ 

LSAPSLFNESESKESAYl£EUHFGFRiU*£KRMNHKGLSKEDMGSY 

GKKL£(XXjL£NPAIFTSPS5KTLAKDTU3ALYPAKQEFYLSTS^ 

HRJFERALQU>NEDSFDQRLEQAIQETSQEREFEArV'QESLEAQFTKEV^ 

AFIQLDNGRSVFVRGKVDRIDRIJCANGAICVVDYKSSLTQFQFPHFFNC 

EPVQSDrfAVKSLAGAVVEASKSMKYQGLFLEKESSYLGEFYNKNKA^ 

AINPYTENGRSLAPYVQQHQAITGFEANYHLGQARFl^KLDLADGKRLVGEKUCQAWLEKIREELXRZ 



MKLIPFl^EEEIQKLQEAEANSSKEQKICrAEQIEAlYTSAQNILVSASAGSGKTFVMAERILIXJLARGV 

AATEUCERLEKKISKXIQETDDVDLKQHLGRQLADLPNAAIG™ 

VFHEVFEAHYQGKQKFTTSHLUCNFAGRGKDERGLRWVYKIYDFU^ 

QOCQALWDLESFFRYHI^NDAKEFAKAAYLENVQULDEIGS^ 

DLX?l^OAYNEERJCTQFAKLGQLSDQlAllJ)YQERYHGDTWKLAKTFQSFMSDFVEAYR 

ILENFPQVRESYQERFHEVMVOEYQDTNH1QERMLELLSNGHNRFMVGDKQSIYRFRQADPQIFNEKFQRYAQNPQEG 

RIJUJCENFRSSSEVLSATNDVFERIMDQEVGEINYDNKHQLVFAhfrKLTPNPONKAAFU-YDKDO 

TGEMRLVKEILKLHQEKGVAFKEIALLTSSRSRNDQILLAI^ 

YALVAU^PMFGFDEDELARl^LQKAEDKVHENLYEKLVNAQKMASSQ^^ 

HSLYDUWKIYMDRFYYDYVGALPNGPARQANLYAUaRADQFEKSN^ 

AVELMTTHKSKGLEFPYVFILNMDQDFNKQDSMSEVILSRQNGLGVKYIAKM 

LQl^YSEQMRLLYVAMTRAEKKLYLVGKGSREKI^KEYPAAK^^ 

SYIUnGEDQLTREAlGELETKSPLQDSSQADNRQSDTIKEALEM 

EGVElAGQGQSVGKKISFDUWSTKEKVTGAEra 

fdt^lxjqeilantohlyreqpfsmucrixjksqedfvvrgildg 
eealsraysieniekylillgkdevqvvkvz 

meuuwaesl^vdautippiyfri^eysvakywndlssaapb^ 
mpvqdiqttvslggedhivfngpdeqflggrlmgaragiggt^^ 
tsahgnm yg vqcevucineglnigs vrs pltpvteeorp vv eaaaauretkerflz 



MYKTKCLR£KLVLFUaFI?aiYQFANYSASFVOT^ 

RGKKEEVASDFYQFTYt^LGl^VVLLGMVLFl^PIILNHIGLEAAVAAVAVRYLWFUIGUPLLLFSVIRSLLOSLGLTKL 

SMYUMLUiJ>LNSGFOTLLIYGAFGVPELGGAGAGLGTSLAYWVLLGB^ 

RLGLPIGGTVFAEVAIFSVVGUMAKFSPUIASHQSAMNFSSLMY 
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TAUFAAFTXTFLYIITIGNVASLYGNDPKFIDLTVRFLTY^ 
VYAC 

MSTIAKIEALU^AGEDGIRVRQLA£LLSU > PTGIQQSL^ 
5 PJNQSL^RAALETI^ILAYKQPrTRIEIDAiRGVNSSGALAKLQAFDLIK^ 
PVIDELEIQAQESQLFGERIEEDENQZ 

MOTMISRFFRHU^UCSUOWGWMTVAA^ 
KEG<yiVTNNDYHKVYI>SLXNMSTVKSVTFSSKEEQYEKLTBI^ 
10 IOUEGVSEVQDGGANTERIJ : Kl^FmvWGLGlAALLl^ 

U^AlAPSVLVFIVYQIVYQSVNKSLVGQNUMISPDU^PUMIALLFVIGVnGSI^GlSM 

MIOCVRFnTjUXFFl^PEGAMASDGTWQGKQYUCEIXi 
YLKSGGYMAKSEWVEDKGAFYYUXJDGKMKJINAWVGTSYVGATGA 
1 5 LQKGKD YYFKSGG YU.TSQ WINQA YVNASGAKVQQG WLFDKQ YQSWFYDCENCN YADKEWtFENGHYYYUCSGGY 

MAA^WtWDKESWFYUCFTCKMAEKEWVTO^ 
SHSQAWYYTKSGGYMTANEVmfDKESWF^ 
WU}GKTTNENAAYYQVVPVTANVYDSDGEXU 

FIPYY^IX3HRFYHYVAQNAS1PVASHI^DMEVGKKYYSA0GLHHX3FK^ 
20 NNSU^NKGATFKEAEEHYHINALYU^VHSALESNW^ 

IC£NTYlDRGinTl^NKASGA4NVEYASDPYWGEKlASVMMKINEKLGGKDZ 

MKKVLQKYWAWAFWIPUXQAIFFYVPMFQ^^ 
WGElALGIFURVLNSmGQTFFRAVVFFFPAVLSGLTVAU^^ 
25 WXW(^VAMPlIIFI^GLQSlPTEmAARnX3ATSKQ^ 

ATTSLGLLVYNYAFKNNQFGYANAIAVILFFUWISI1QLRVSKKFEIZ 

MMKQDERKAUGKYTIIJLGSVLILVPUATU^FKPTKDIVDNFFGFPTNFTW 

LAVMIFTPMAAYSlARNMSKRKAfTIMYTUJUjIFVPFQVlMlPrrVMMS - 
30 SIPESLDEAAEITCANQFTTYFRlIFPMMKPMHATTMnNALWF 
YGPSFASYVVGOSrnVYLFFQRHHSGMSNGAVKZ 

MKSaQKMGEHPMLLXFmSTVlS^JVQ^WMGLVASVGMFLF^FFU^fQS 
FQrVTCKFNYAFI^PNMQVWHQNRAEVTFFNPNYYGIICCFCIM^ 
35 AFPAIlAGAirYLFTmNWKAFWUIGVFAiGUFU^DLGVRMGTU>SSMEERJS!WD 
NSYPRIHAPYHEHAHSLYIDTII^YGIVGTTIXVLSSVAPVRI^MMDM^ 
GFIFLLVMCSIPLEHRMLVSDMTDZ 

MSKMDVQKIIAPMMKFVNMRGOAUCDGMIjULPLTVVGSLFLIMGQU* 
40 MGLlSCFSLAYSYAKNSGVEA^AGVt^VSAFFIIXRSSYlPKQGEAIGDAiSKVWFG 
nCMPEQVPQAlAKQFEAMU*FVIFl^MIVYILAKSLTNG^ 
GQSVVNGVVTALLLSNUDANKAMLASANLSIiW^ 
AFPAIFNVNEPWFGFPTVMNPVMFVPRLVPVLAAVTVYGAU^ 
LAMSTLVYFPFFKVQDRLAYQNEKQSZ 

45 

MKKKDLVDQLVSEIETGKVRTLGrYGHGASGKSTFAQELYQAU>STTVNLLETDPYI^ 
VAHELESLQRDILACRRVWMSZ 

MKKRYLVLTAIiAUU^CSQECTKNEDGETTCTEQTAi^ 
50 KHYPtfKDYNPGENPTAKAELVKUKAMQEAGFPlSDHYSG 

GLAFDVIGTTCDLVTEEKAAQWlXOHAAOYGFWRYlJCGKEKETGYNiAEEW 
EGGDYVDZ 

MRETOFLNHFUCKGYFKXHAKAVL^ 
55 LPrYlSNFSGEFSEARARNFRYDFFQEVMKKTGATALVTAHHADCK^VETlFMRL 

HFQKKDFPSIFHFE0TSNQENHYTTINR1RNSYU > ELEKENPRFRDAILGIGNEIU3YI)LAIAE 
TQRVU^TYl^W*DLNLTKAQFAEVQQ!UC^ 
QGYl^R3U^EGEU<WlPVSRErSmi!WRJCTGDVUi^ 
SKKTKNDIMNTVLYIEK1DRZ 

60 

MRKFUIUXPSFLTISKVVSTEKEWYTSKETYYLSQSDFGIYFR 
EWRLNKQGlPVFKI^NHQFlAADKRFLYDQSEVTPmK^ 

FLHIDQAGWVAKESTSEEDNRMSKVQEMLSEKYQKDSFSrYVKQLTTGKEAGINQDEKMYAASVL 
EGLYQLDTTVKY^SAVNDFPGSYXPECSGSLPKKEDNKEYSUCDLnXVSKESDW^ 
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IMGDDWDPKEKL1SSKMAGKFMEAIYNQNGFVLESLTKTO 
SIFTKNSDYDTISKIAKDVYEVUCZ 

MKKQNNGUKNPFLWLLnFFLVTGFQYFYSGNNSGGSQQINYTELV^ 
5 EETGIQFFTPSVTKVEKFTSTIU > AiyrTVSELQKI^TD 

NGRNPMSFGRSKAKAANKEDUCVR5SDVAGAEEEKQELVEVVEFUCDPKJUTKU3AWPAGVLUG 
VAGEAGVPFFSISGSDFVEMFVGVGASRVRSU^AKKAAPAIIFIDEIDAVGRQ 
FEGNEGUVtAATITODVLDPAUJUKjRroRKVLVG^ 
VLNEAALVAARRNKSUDASDIDEAEDRVttGPSICKDKTVSQKERELV^^ 
10 GGYMULPKEDQMLL£KEDMKEQLAGl^GGRVAEEIIFNVQTTGA5NDFEQ 
NHAMI^AQSPQKSISEQTAYETOEEVRSUJ^EARNKAAEIIQSNRET 
EES HALS YDEVTCSKMNDEKZ 

MKRSSLLVRMVISIFLVFULLALVGTFYYQSSSSAI^^^ 
1 5 SQDKVEGIRDLFLTXUC5DKDUCTVVLVTKSGQVISTDDSVQMKTC 

VTSVTQELVDAKGANLGVLRLDlSYETLEAYLNQLQLGC^GFAFnNENHEFVYHPQHTVYSSSSKMEAM 

YTPGHK5YVSQEK1AGTDWTVLGVSSLEKIJX?VRSQLLWTIXGASVTSI^ 

GAQNUUKEVGAYELREVTRQFNAMLDQUX^LMVA^ 

WQVHTCSUVTYFRijaNC^KDUCI^DEI^ 
20 KEKEGQGHIKUVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLKRGGVGLQ 

EIYINRIETSZ 

MKRSSLLVRMVISIFLVFULLALVGTFYYQSSSSAlEATIEGNSQTTISCyrS 
SQDKVEGIRDLFLTtlJCSDKDUCTVVLVTKSC^^ 

25 VISVTQELVDAKGANLGVUUJ>ISYETULAYUVQLQLGQ(^ 

YTTGHKSYVSQEKIAGTOWTVLGVSSLEKLIXJVRSQLLWTIXGA^^^ 
GAQNLRAKEVGAYELREVTRQFNAMLIX510QLMVAUISQEETTRQYQ[^ALSSQ1NP^ 
WQVTKSLATYFRIj\l^(XJKDUCUDElNHVRQYLnQKQRYGDKLEYEINENVAFDNL^ 
^CEGQGHIKl^VQKQDSGLVTRIEDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMiaM 

30 EIYINRIETSZ 

MFFKUJlEAUCVKQVRiKIIJTTFTVLVTOGTSrWTO 
VVQLLQMDIU>KFVEWGK(#EVGRRK1J^ATRYLAL^ 
VTWLGEQrTDKGYGNGVSMIffAGIVSSffEMIQGrYVDYFVNVPSS 
35 VAQGAPSSSYLPLXVNPAGVIPVIFASSITAAPAAlLQFl^ATGHDWAWVRVAQEMLATTSPTGlAMYALLin^ 
VQINPEKAAETYKRWPISMEFVLVKVQKNICLNFFWLQLLVPSSLV2 

MDIRQVrrETUMIEEQNFDlRTTITtfGlSLLlX^ 

AATDATDYVVLAKAU^KAAKEIGVDRGGFSALVQKGYQKGDEILINSIPRALAHTDKVCS^ 
40 MGRIIKETANl^DMGVAKLVVFANAVEDNPFMAGAFHGVGEAOVIINVGVSGPGVVKRAl^KVRGQ^ 
TAFKITRJGQLVGQMASERLGV^FGrVDL^LAPTPAVGDSVARVLEEMGL£TVGTHGTTAAL^ 
NQVGGLSGAFIPVSEDEGMIAAVQNG$UVL£KL£AMTAICSVGLDMIAIPEDTPAET^ 
PKGKEGDMIEFGGLLGTAPVMKVNGASSVDFlSRGGQlPAPtHSFKNZ 

45 MTQUDGKALAAKI^QLAEXTAKUCEETGLVPGLVVa^ 

DUAKYNQDPAWHGILVQLPtPKHIDEEAVLLAlDPEKDVDGFHPLNMGRLWSGHPVMIPSTPAGIMEMF 

KNAVVIGRSNtVGKPMAQLLLAJCNATmraSRTHNUKVAAKADlLVVAlGRAKFVTADFVK^ 

GKLCGDVDYEAVAJnjVSHnWPGGVGPKCTn^LMEQTYQAALRTUDRKZ 

50 NGKFNRIHLVVl^VGIGAAPDANNFVNAGVPDGASDUGH^ 
ATKl^EVSLGKDTMTGHWEIMGl^rTEPFDTFWNGFPEEILTKI^ 
YT3ADPV^lAAHEDIIPlJ)ELYRICEYARSrri^RPALLGRIURPWGEPGNFTRT 
DTYAVGKWDlFNGAGINHDMGHNKSNSHGIDTUJCTMGtJ^^ 

RLPEriAAMR£NDUXn"ADHGNDPTYAGTDHTR£YIPLLAYSPAFKGNGLlPVGHFADlSAW 
55 DKLVZ 

MFlSISAGrVTFU.TLVElPAFIQFYRKAQTO 
ILFILVLYGLVGFl^DFlXVFRKINEGLaNPKQKLALQLLGG 
AVNLTTCVDGI^ISVVISI^AYGVIAYVCXKJMDIIXVILAMIGGLLGFFIFNH^ 
60 LHQEWTUJIGIVYVFETTSVMMQVSYFKLTGGKRiFRMTPVHHHFELGGLSGTO 
LAILYLM2 

LRCKNKDILNIALPAMGENFLQMLMGMVDSYLVAHLGLUISGVSVAGM^ 
YHVTEALKrTIXLSFLLGFUlFAGiCEMlGLLCrrERDVA^^ 
65 SNALNU-FSSLAlFVlI)MGLAGVAWGTrVSRLVGLVILWSQUCLPYGKPTFGLOICElXTLAl^AAG 
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LVVSFGTEAVAGNAJGEVLTQFNYMPAFGVATATVM 

LTHLYTTDSLAVEASVLVTl^LLGTPMTTCTI^inAVWQGLGNAlU^FYATSIGMW 
AGSLLDNGFRWLFLRYRYQRYMSUCGZ 

M<TOEKHSQAAVLGLQHLLAMYSGSILVP!MUTALGYSAEQL^^ 
FQSVAPUMlGQSHGSGAMFGAUASGlYWLVSGnSKVANUTCIV^ 
AnVLIIlXINnTXGFTKSISIUGLWGTAIAATMGLVDFSPVAVAPLVHVPTP 
GVYI^l^DmCDPIDSTRLiWGYRAEGl^VLLCGl^ 

LAQnPSSVLGGAMLVMFGP/SIQGMQILARVDFANNEHNFUAAVSlAAGVGU^NSW-FVSMPTAFQM 
LAIVLNAVLNHKKKZ 



MKDRIKEYI^DKCKVTVNDI^QAUjKDSSKDFREU^^ 

FVSLEGEEDDLiWGKNDVNYAtDGDTVEVVlKKVADRNK^ 

QKISQPIYVKXPALKLEGTEVUCVHDKYPSKKHDFFVASVLDVV^ 

DAPSQKDMEGRLDLRDEnTTlIXjADAKDLDDAVHnCAIJ^^ 

RWPML?ERUNGICSLNPQVDRLTQSA£MEIDKHGRVVNYT^^ 

LMAKUIETLEMtfRVKRGALNFDTNEAKILVDK^^ 

EPKAEKVQKiTOYASSFGLRIYGTASElSQEALQDlMRAVEGEPYADVUMM 

HTOPIRRYPDULVHRMiRDYGRSICEIAEHFEQVIPEUTQSSN 

KFGLPVELPhHVEGUHITNlJEFYHFNERDLTLRGEKSGrTFRVG^ 

SRSGRGRDSNRRSDKKEDKRKSGRSNDKRKHSQKDKKKKGKKPFYKE 

MGTTGFniDLIILIVYU^VLVAGrYFSKKEMKGKEFFKGDGSVPWYVTSVSIFAT^ 

QLGMVVAIPLTTRFILPIFARIDJDTAYDYLDKRFNSKAIJUISAL^^ 

AlVYSYTGGUCSVL WTDfl QGVIUSGWLALFVUANreCGFGAV^ 

TlI^YASSQDLVQRFTTTQNlKKl^OCMl^NGVl^l^TATVFYUGTC 

PVGrrcULAAIYAASQSTISTGLNSVATSWTLDIQDVISKNMSDhfRinX^ 

NSFMGLNT-GLLGGVFILGFVSKKANKQGAYAAtlVSTIVMVFIKYFl^ 

APKYTTTHDITEIKADSSWEVRHZ 

MKFSKKYtAAGSAVrVSl^LCAYALNQHRSQENKDNNRVSYVIXjSQ^ 

TSHGDHYHYYNGKVPYDAl^SEEUJrfKDPNYQLKDADrVNEVK^ 

QEHVKDNEKVNSNVAVARS(y}RYTTNDGYVFNPADIJEDTGNAYWPHTC 

QPSQLSYSSTASDNhn"QSVAKGSTSKPANKSENLQSUJCELYDSPSAQRYSESDGLVFDPAKIlSRTPNGVAIPHGDHYHF 

IPYSKLSALEEKlARMVPlSCrTGSTVSTNAKPNEVVSS^ 

HFHYIPKSNQIGQPTUWNSLATPSPSLPlNPGTSHEKHEEDGYGFDANRnAEDK 

VRKNIZ 

MKXRAIVAVrVIXUGLIX^LVKSYIVQQIPLGEV 
HMEDSFWMVIGLTUIAGGLGNFTORVSQGFVVDMFHU>HNF^^^ 

MhH'Nlj^FrVGUIDENDRFYFVQKDGQTYAlJVKEEGQHTVGDrrW 
EVRKDLGVFVOTGLPDKEIVVSIJ5ILPELKELWPKKGDQLYIRLEV^ 
WPAJVYRIJCI^GTFVYU , ENNMU3FIHPSERYAEPRUjQVU)ARVIGFR£VDR^ 
SNGGFMTU^KSSPDDIKATFGISKGQFKKALGGUMKAGKIKQDQFGTEUZ 

MKDVSLFLUCKVFKSRUfWTVLAU^ 

AKNNU)VQKNUJTUCTEILTUJ^^ 

TLEFPTHGIDQIVWILEVnPSLFWAIIFMLTQLFAERY 

GSUSG FGQLDYPYPlYSLVNQEVTIGIGQOVLFPGtXLAFLAflVIVEWYLIAYFFKQKMPVUT^ 
U}RIAHUPFTYLRSVEU^GRU>KQIDNVD^WSMGMVU^^ 

MMKFHJJIVSTPAILVAUAII^LVLQK^ 

VAVALTTYGSATAMIMFAGMVFNILURiTRFKYIFLTCHHTL^^ 

VQKYMVQLTGNDKVALGHFSSUIYWUGFrGSUGDK^ 

EISSGTSGLVYAUJl^GQFAAGV^OAGVRLI^ 

MVIMIASGTVVILPGVVPHFFCGATAGVIGNASGGVRGATIGAFLC^I^ 

LGMLNQFGSQAGIVIGLVULAVMFGVSFIKKPSATEE2 

MIKTFl^Al^VIU^IPirrYSn^NUNIWLSTQPII^QIY 

LLFGTDKTl^ASNKTKTUaVTWNVANQl^ 

IFrepPTNSGlAPVTVIVKKSYGFYTEAiCTFHTTRFGTIVLHSRKQ 

KAIIAGDFNATMRHGALAKISSHRDALNAU>PFERGTWNSQSPKU : N^ 

ETTFZ 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



77 



PCT/GB99/02451 



MNPIQRSWAY^SRKRIJlSFTLFLILLVLI^GtS^ 

EWSPELETVAiQjCDICEAVTGEQSVERDDLSAADNNLVSLTA 

AKKNGL£UtDKIGLDAG<£ESGKG<7rVEFEHGIFSGKX^ 

WKEMDGDi4KQVENLAl^(#YQVEKEh^^ 

L^GKGKSSlFl^FCLEVn/LVSLGAU^AFVAGNAJTTYLLQTlXASG 

CLSVALCFLFLFRKSPKEILSSISZ 

MUWAFAYVITlKFFKSrVIFUIlUvtAS^ 
TENKAIESYVKRINAIGDLTGYDUETPEIXKNLTADRAK^ 
OXHKDLAAKHGWKVGDKVKl^KTnjADNEKGAKETVEVm 
TEOTAIYGDATFFVTADKhOJJDVMKELNGlSGINW^ 

LSLWTNARRKEVGIlXSIGLKQASIUXJFrrESIUAlPALVSAYFLANYTAKAIG^ 
GAEVDGFSKT^^LDISIQTSDFmFVIJ^LVLVyLVMAL^SSNURKQPKEUXDGE2 

MSQDKQMKAVSPUU3RVIWSSIVGGVGSUFCIWAYQAG^ 

AGVHYGHIIGTIYNYIGrVIGCAUFYLVRLYGAAFVQSVVSKRTYDKYlDWI^KGNRFDRJTIFMMIWPlSPADFtCM^ 
ALTKMSFKRYMTUILTKPFTLVVYTYGLTYnDFFWQMLZ 

MRNMWVVTKETYLRKVESWSFFFMV1SPFLFLGISV 

ASAKEAKEEKLKGYLTnXJEDSVUCAVYHGETSl^GIKF 

AXEWaCHQTlAAGALGFn-YMUJTYAGVTAQEVASEKCnrKIME 

AAVLU*DLPFIj\QSGIU)HLGDAISLNTILFIUSLFMYW 

GDNLUJUGSYlPFISTFFMPFRTINDYAGGAEAWISLAnVlFAVVATGFIGRMYAS^ 

MTFnKUvlKAHTSVRRFKEQEIPQVDLNElLTAAQMASSWKNFQSY 

GDLNRAEKGARLHTDTFQPQGVEGLUSSVDAAij*GQN^ 

MALCVPNQKHDMKPRIJ>LENVVFEEEYQEQSrEA!QAYDR^ 

KLLZM LKUAIVGTNSKRSTNRQLLQ YMQ KHFTDKA EIEL VEIKAIP VFNK PAD KQ V PAEILEIAAKIEEADGVDGTPEYD 

HSlPAVUiSALAWUYGIYPU.MCPlMrrGASYGTl^RAQ 

DVIKiCl^AlFDDFRJFVKrrEKLRNAQEUJUCOAEDFDWENLZ 

M^QLNNGVEIPVLGFGTFKAKIXiEEAYRAVl^^ 
QQTTEQTRQA1£KSIEKLGU)YLDLYL1HW^ 

LETATWPAVNQVRLAPGVYQDQWAYCREKGUXEAWGPFCH^EL^^ 
LPKSVmRlQAm.DCFGIEUHEERETliCTlAVQSGAPRVDOVDFZ 

MRCKMLDPUIQLGPbURWYALOVTGUl^V^ 
EIFAlWNGGLAIYGGLJTCjALVt.YIFADRKUfn"WDFLDLAAJ^ 
MYIEGSYRQPTFLYESLWNUX3FAUUHUIKWKSUTO 
GLGIMrVIYQNRJCKAPYYTTEEENZ 

MGKLSSILLCrrVSGAALALFLTSDKGICQVCSQAQDFLDDLREDPEYAKEQVCEKLTEVKEQATDFV 
VDSU^QTKSYAFQATEASKNQLNNUCEQWQEKAEALODSEEIVIDrrEEZ 

MKTKUFWGSMLFlXSI^ttiTTYLAWIFYPME 

LHHFAVVKWJ^VQLVALmPSFYVFVNRlVKKDFULYRKSLLALVVLPVMIGLCGVUGFDQF 
DTWLFDPAJCDPVIMILPETFFLHAFLLFFALYENFFGYLYLKSRRK2 

MTYHFTEEYDUVIGAGHAGVEASLAASRMGCXVU^TINIEM 

YIQMKMUNTGKGPAVRAUUQADKELYSKEMRKTVENQE^TLRQTMIDEIL 

TGTAUU}EniGDLKYS$GPNKSLASn4LADMJCEI£LB^ 

VKIXJVPCWLTYTNGTSHEnQNNUtRAPMFTGVVKGVGPRYCPSIEDKIVRFADKE 

SLPEDVQRDLVHSKGL£NAEMMRTGYA1EYDMVU ) HQLRATI^TKK1SGUT , AGQT 

ia<^KPEUUCR$DGYlGVMIDDLVTKGTlEPYRU^ 

DNEMKRJJDSIKUCPVKETNAJCVEEMGFKPLTDAVTAKEFUUIPEVSYQDVV 

AMDQVAKMKRM£EKRIPANIDWDDIDSIATEARQKFKLIW 

MTKQVLLVDDEEHILKlXDYHLSKEGFSTQLVTNGRKALALAETEPFDFTLLDfM^ 

MVSAKSDEFDKVI^L£LGADDYLTKPFSPR£U-ARVKAVUIRTKGEQEGDM 

NKLLSLTPKEFESDKNPFFEVFKVSKVTAQZ 

MTTFKDGFLWGGAVAAHQLEGGWQEGGKGlSVADVfcrTAGRHGVAREnW 

AEMGFKCFRTSUWTRIH>KGDEl£PNE£GLQFYDNLFDECl^^ 

AEVVFKRYXDKVKYWMTFNEINNQANYQEDFAPFTNSGIVYEEGDNREAIMY 
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GCMIAMCPIYPVTCNPKDtLVtAMKAMQ 
YYMSFAIDSHRENNPYFDYI^TEDLVKNNYVKASEWEWQt^ 

N(VHDDYWEYLGAH1REMKKAWEDCV0LMGYTPWGCIDLVSAGTGEMRKRYCFIYVDKDD 
WYKEVISSNGESVEZ 



MDQQNGLFGFLENHVMGPMGKI^QFKWRAITAAGMAAVPFTIVGSMFLV 

MV ANY ATMGSl^L YFVl^l^YH-TICIYAEEEELNMNPLNGALLAI^AFVMTVPQ QFDGGMMKTVTSUCEG A VIADG 

WAMGhTVn/\RfKJTTGIFTAlIMAIVTVUYRM 

DlFKVUIPTOFVS^NSWlGLMnYliTQUWIV^ 

GATLGUXYUFASKSEQUCAIGRASVVPAUWNEPUFGm^ 

TPVGIGAFLGTADLRAVLVALVCAFAAFLVYLPFIRVYDQKLVKEEQGIZ 

MKKFYVSPIFPILVGUAreVlJTFUFVNNNU.T\'ULPU n /GGYV 

MPVGVMKI^U^GEVEWI^PYAEULTKEDGDFDLEAVQTUKASVGNPSTYAK^ 

EQAlTDELVTSRPVIGIVSVDffYDDL£DETSESDISQINSFVANFISEFSEKHMMFSRRVSMDRP 

FSVTOAFREESKQRQLPLTUMGFSYGDGNKDEIGKVALLNLNl^ 

RTRTRAMMTAISDKIHSVlX)VFVn/GHK^ 

L±SVKDAMGMVTNWliILVDHSICrALTLSKEFY^ 

KXNRl^RMQASVLNUGMMUJTKNFTSRVT^ 

AEAKDMKCYDTWISKAADAMLAMSGtEASFVt^K>T^ 

EAGEKLTEIVLNEMKEKEKEEZ 

MKEKNMWKELLNRAGWILVFUJVVU-YQVPLVVTSILTLKEVAU.QSGU^ 

RAKDl^RLGLSYXVIVGSNUXiSllXQLSNETTrA^QSQINDMVQNSS 

GFVVGTWFALLHQPSNLPSLLiYGGMSTVLSWTAYKTQ^ 

MKEKNMWKELUNRAGWn-VFU^VU.YQVPLVVTSrLTUCEVA 

RAKDLARLGLSYLVWGSNILGSIU.QLSNETTTANQSQIND^^ 

GFVVGTTVFALLHQPSNU^IXIYGGMSTVl^WTAYKTQR^ 

MOTQKIEAAVKMIIEAVGEDANREGLQETPARVARMYQEIRGUKyrAEE 

LPFYGRAHMYIPDGRVAGL^KLARTVEVYSKKPQIQERLNIEVADALMDYLGAKGAFVV 

ATLTTVARGLFETDKDUUXMYRIJtfGl^KD^ 

FPENHWP!LLFVGrrSVUXLWGCTATYMEAPDKLFLLVGEEElKU{LK^ 

GLPVRLYVllXGVGKYFHFCQKASKFFTETCLDWDYVISQESK^ 

QKVPCKIWQNLYLRSYUWGDLFALSLRLLLUU^QVREQAW 

DKGQKEKGI^EVVRGLTSFVlXVELVVGUTFQEia^ 

MRKSrVLAADNAYUPLETTKSVLYHNRDV^ 

SYFRi*ATEWESDRVLYU>SDIIVTCELATLFEIDlXGYSIGAVDDVYAYEG^ 

LELAAEQNQWHLGDQSUJflYFEDNWlj^LDKTYNYMVGlDlYH^ 

LWWVYIU)U>WSEUf^RSDLNYFERSNQSKKQVMLV^ 

\TTVYQNVLHSR1DWLU>DS^YU)INTGGEVFNVVT^ 

F.IKZ 

MTmSStAVKKGUTCFIXFIYVUJSRHLPFV^^ 

MFSRKRLGLTSTSIEIQDRRKMYLTUJAVIQSLAVSU^ 

VUXSSMVLNlPQDVLETFQTVHIPTGnVU^LLTLV 

YMYVMSFUVPAYU^GFIFPNHSGIjKAUKEFMVGKPLW^ 

YTCADTSRFINRLVUIFSVIGGLFNVIMAGGPMLFVLH>EKIX 

MSSLSDQELVAKTVEFRQRLSEGESLOOILVEAFAVVREADKR1LGMFPYDVQVMGAIVMHYGW 

ATMPVYL^AFSGEGVMVVTTOBYLSKRDAEEMGQVYRFLGLTIGVPFTE^ 

YLNDNL^NEEGKiTJU>FNYVUDEIDD[lXDSA<7rPUU^ 

AKSAENFI^tDNLYKEEHASFARHLVYAlIUHKLFTKDKD^^ 

IO^PETRAMASITY(^LFKMFNKISGKfTCrrGKVAEKEFlETYNMSVVRlPTN 

YHAKGNPLLVFVGSVEMSQLYSSUJTlEGLAHhfVLNAN^ 

CKjUVTGTERMESQRIDLQIRGRSGRQGDPGMSiCFFVSLEDDVtKKFGPSWVHKKYKDY 

VEJCAQHASDSAGRSARRQTUsYAESMNlQRDrVYKERNRUIXjS^^ 

ISFHVKEVPDYIDVTDKTAVISFNUCQVTDKEI^EK^ 

ASQKNPIVEYYQEAYAGFEAMKEQCHADMVRNLLMGLVEVTPKGEIVTHFP2 

MIGTFAAALVAVLANFWPIETTPNSArfrEUPPDGIGQVUNlXUaVDNPVNALLTANYIRIl^ 
QEUJCTlADVTSKIVEWUNLAPFGtLGLVFKTlSDKGV^ 
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NCXRVSGVTAFFTRSSATWVNMKLCHDLGLNPDTY 

VAAISSCDA5GlAGGSlXLIPVACSLFGISNDlAlQIVGVGFVIGVIQDSCETAi^SSTDVLFrAVAEYAATO 



MSISQRTTKUlj^Ta^CU>YFLNl^AVSAGIlAlXSLSDTRRy 
LAFYVPLAYKMGWEIGrTPSTVLVSHIXVQESTSPDLLVNEFLLFAlCT 
LQRFKYYt^RGDGRNRAQLVAEU5TUiCEAUU.VYU)HSDHL 
ESUl^QLFSKUGQLSQTNPASDU-DEIERYI^VFRNW 

MEIMSLAlAVFAVaGLVTGYVSISAKMKSSQEAAELMU^AEQ^ 

EEARKYREEVDAEFKSERQEUCQIESRLTERATSLDRKDDNLTSKE^ 

L£IUGAUQA£ARDOlAQTE£NLTR£lASRIR£A£QEVK£IUDICMAKD 

RflGREGRNIRTFESLTGVDVIIDDTTPEVVTLSGFDPIRREIAAMTO 

AAYEIGAPNUU^LMKIMGRLQFmYGQhrvaJ^ 

IGMELARKYKEPPVWNTnASHHGDVEAESV 

QAGREIRIMVNPGKIKDDKVTILAHKVRiCKIENNLDYPGNIKVT^ 

MKiUCPSIDTlXDKVPSKYSLVnXAiaUHElJ^ 
RKEEEEKKKEQIAKEKEDGEKIZ 

^«AYQU^WQDEASNQGAFTGUWmGARFEQ^UJKGEQAFQ^ 
AIMDGDQFGSDFVTCLNPNSKIPALLDQSGTEWRVFESAHILLYLAEKFGA 
GGHjHFFNYAPEIOEYPINRFTMEVKRQLDLLDKELAQKPYUGNDYTIA^ 
YQNL VKWAEK1ANRP A VKRG LEVTYTEIK2 

LASLJTSlIMFYVGFDVUU)TlQKJLSREErVIDPLGATLGIISAAlMFV 
GTALAlLASSFNYTrVT)KLVAnnTFIUCTAYDl 

I^MNPDLSVFESHEIADQVESMLEERFGVFDTDVHIEPAPIPEDEII^NVY 
DGEQMDKEAYKTKKELNSAlKDlQrTSiSQKTmCYEI^ 

SnTKLVATDMDGTFUX}NGRFDMDRIJCSU.VSYKEKGI^ 
YEATMSRDFYLATFEKUCTSPYVDIhOClXLTGKKGSYVLDTVDET^ 
TLEDGEAWVNENVTOVKAMTTGFESmiVUJYVDKGVAIVELVKKLG™ 
NARPEILELAKTVTGHHKERSVIAYMEGLZ 

MADIKLIALDLDGTIXTTDKRLTDRTKETI^AARDRGKVVLTTC 
NTGEILDKTVFSYDDVARLYEETEKUI^UMISEGTVYQIQSDQESLYAK^ 
QEPLDAAIQKISPELFDQYEIFKSREMU-EWSPKNVHKATGLAK^ 
QNAVPEVKAAANWTPMTNDEEAVAWAIEEYVUCENZ 

MESLULLLLANLAGU : UWQRQDRQEKHLSKSLEDQADKL^ 

QGLTQVRQEMTDNUIJTRDKTIXJRUMUJESNEQ 

TVARDVGAUreVLSGTKTRGILGELQLXjQUEDnvnTAQ 

DYYRLEEAYETGDKDEIERCRKSLl^VKRFARDIRNKYIAPPRTrNFGVli^ 

GPSTLSAIXNSl^VGFKTLNIQK&ADHISKTl^ 

GEPALOLLHFQENEEEYEDZ 

MKISHMKKDELFEGFTUKSADLRQTRAGKNYU^ 
PQVNQraRU^AGEPNDPADFKVKSPVDVKEIRDYM^ 
GUVYHTATMVRlJVDAlSEVYPQLMCSLLYAGIMUmijVKVT^ 
KEEVVLLRHVTLSHHGLLEYGSPVRPRIMEAEifflMID^ 

MSEKAKKGFKMPSSYTVUJIIAIMAVLTWnPAGAFIEGnfETQPQ 

VAfmMVGGFlX3IVNKTGAU>VGIASIVKKYKG^^ 

LTGVAmXGSQlGCLASTl^PFATGlASATAGVGTGD^ 

UCHIWVEESSSVESTI^KQKSVUTJ^THl^ 

AMU^FMGOJGVIYGUCEDiaiSSFMNGAADLU^ 

Yt^MSFUPSSSGLASATMGIMAPL^EFVWRPSUrTAYQSASGVLNUAPTC 

IIWTIALLLLGTFLPFLZ 

MSNSFVKLLVSQU^\NLADlFFR\miANIYIISKSVlAm^ 

VGMFTVMQSVAPLVTYI^VAISUJXjFAAPVSYAJVPRYATDLGKANSAI^MTGEAV^ 

CINLVLYlISSFU4U^PNAEVEVl^ETNl£nXKGWKLV 

WGYSOTAYStGIOSGmFRI^EKFLAAKWEPQLiTPNLKTC^ 

RKETHMNSLPNHHFQNKSFYQLSFDGGHLTQYGGUFFQEI^SQUCUCERISKYLVTNDQRRYCRYSDS 
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TGYGTDYACKH^DAYFPKU^GCXJLASQPTI^RJ^RTDEE^ 
GKQEGVAYNAHYRAHGYHPLYAreGKTGYCFNAQUTC^ 
CTGQ YYLIKUCKhnVLSRUjDULPCPQDED^^ 
GTSQDQI^LYRCRCQAENFIKEMKEGFrcDKTDSmiJCNEVRM^ 
5 HVVGKCVRTGRKQUJCl^YAYSELi^ALYSRIRKVMUNU>VPYEPPRRKASLMMHZ 

MMEFH^LPHLEPYGNPQYFVnrVIAATLP^ 
YKHYRJCSKDCKWVFYLVSFUIXPIIFVKVQPAINGTQSUGFI^^ 
" FSSGPIDRi^RiVEKYQAlPERDEU^DMLOESVRYIMWGFLYKHLAHVLGETlXPP 
1 0 F<3L£IJTDFAGYSMFALAlSNUdGIRSP!NFNKPFIJ>^ 

VTSSMAYTVNMUMGFWHGVTWYYIAYGLPHGIXLVTNOAWVRKXKT^ 
WMLSFUFSGFLNNLWFKKZ 

MLXRLWMIFGPVUAGIXVFUJFFYPTEMHHNLGAEKRSAV 
15 GAHPAVLAEKYNRSYRPYU.G(^GAASLNQYFGMQQMLPQ 
FUCHQSGDQASQYAATRIIi&FPNVAMKDLVQKl^KE^ 
AKYUCIUMXJFSYQAIEDVVKADAEKNTSNNEMGMENY^ 
SICVNPIFUPPVNlQCWMOTAGUtEDMYC^TVQKm^ 
PFLSNPTPAPTYHLNERFFSKDWATYDGDVKEFQZ 

20 

MEKNLKAUCQTTDQEGPAfEPEKAEDTKTVQ 

KMTQAEYKAYYTKGYHTDVrTONITD^MEFVQGGQSKKrrYKYVG^^ 
QFSDHWAPVKA£HraiFFGGTSQEAiJ ? EEMDNWPTYYPDNLSGQElAQEMlAHZ 

25 MKDGHLLAHHIRLLNGRIH)K1XSQDPEALYRGEQGKI1^ 

VIVSPCClCDKIUaCYLVLTELGICSQKEVGHRVSQlCU)TIFYKGFSEEEIHQre^ 

MTNLIATFQDRFSDV^TAJ^HLQL3LLTLI.l^ 
PALTALVH'AIFPn.QrmrcLKGIDPNLQEAGL^ 
30 GSFIlXGIDRNNASUUGALSSAVtjUAFNFLUCVMEICAKUnTO 

EIIANMYKIXIEENTSMTATVKPNFGKTSFLYEALKKGDIDIYPEFT^^ 
HLAYLKPMSYQrOYAVAVPfCKUQEYGLKTISDUaCVECQUAGFTUFND 
QAIQSGDIQrTDAYSTDAEI^YDLQVLEDDKQLFPPYQGAPLM^ 
VEGKSAKQVAKEFLQEQGLUCKZ 

35 

MMHTYLQKK1ENI1CTTLGEMSCGYRRMVAAMADLGFSGTMKAIWDOLFAHRSF 

VDWIGMICSLTGUCVIFVSEGRASNYI^UNSVIYLII^LQKGFYGEVLTTLYFTVMQPIGLLVWIYQAQFK^ 

ARKLDGKGWTKYUtSVLWWlj\FGF!YQSlGANRPYRDSrn>ATNGVGQILMTAVYREQWlFWAATNV^ 

LQIQGKYUYUNSLVGWYQWSKAAKQNTOUUNZ 

40 

MRNMKAKYAVWVAFFUO-TYAIVEFUGGVFGSSAVLADSVHDLGDAU^^ 
ALVTAVILVTGSVLVILENVTKILHPQPVNDEGILWI^IIA^ 
VLRFTDWYn.DPlX^LVISFFII^KALPRFWSTLKlFU>AVPEGLI)QCQVKSGLERU3 
^ KVCUCEMEHMETOCESIIUFUCDCGHJNm 

MIEYKNVALRYTEKDVUU)VNLQlEDGEFMVLVGPSCSGKTmUCM 

YVLQAlALFPNLTVAENlAUPEMKGWSKEEnXKTEELLAKVGLPVAEYGHRLPSEI^GGEQQRVG^ 

MDEPFSAU)AlSRKQU3VLTiaaJ{KErc\rraFVTHDTDEALK^ 

GSVHDZ 

50 

MSAVAISAMTKVMQETHGKPSSIHGHGRQAGK1XREARQELAQ 
inTAIEHHAVlCTIDYLVQHPGFEATUQPENQEn'AQQlQKALRDDTILVSTMP^NNETGN 
VDAVQAIGlOPIHSEELGIDFLTASAHKFHGPKGlGFLYASSMDra 
EKQEEHFQHVQNLCTARAELEGIQYYUNRGKHHLPYVUflGF 
55 AMYGANSCRXJCESUUSLSPQhn^EOLQTLAKTLKEIIGGZ 



MLFKLSKEKIELGLSRI^PAJUUFUFALVtLLGSLU^LPFVQVESSRATYFOHLFTAVS^ 
QnCUXlQIGGLGUtfTnGVFYIQSKQIOSLRSRAT^ 
60 GLJSSIFLAlSAFCNAGFDNUSSTSLPAFQTDLLVNLVWGUrrGGLGFMVWFDLAGHVGR 
GULFCrrATTUT^NNAGTIGW.PVADKVLVSFFQTVTMRTAGFSTTDYT^ 
riTFFVLLVFARSELI^U>HANVARRTUPRTVQKSFSVFIlFlJviSFLIGLILLGrrAK^ 
TPDLGKLALSVlMPLMFMGRIGPLTLFVSLADYHPEKKDMIHYMKADIStGZ 

65 
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MSDimGIUJLGircSSVLAALAKQDMNIUlDDHAEJUNQFEPVLARGVIGDITO 

VLAVMHOCSt^VPTVlAKVKSQTAKKVLEKIGADS^ 

QSLSKU3LRGKYNU*U^FREQENSPLDVEFGPDDIXKADT^ 

5 

MKU^IAISSYNAAAYU4YCVESLVlCK3EQVClLnNrc^ 

GRYFKVVDSDDWVDPRAYUCILETLQEUSKGQEVDVFVTNFVYEKEGf^Rl^ 
GQ YTMMHSUYRTDLLRASQFZ 

10 

mkiwnqrytowsiwu^vgv^ 

egvtg^nslptptertevseetspssldtlfekdeeaqicnpeltdvlxetvot 
tkdsid^aaylekaegkgpftagvnqvipye^agdgmltruxkasdnapwsdnct 
vdlng^gk<xx3alidqlrangtq^^ 
15 idvpaaylekakgegpftagvnhvipyelfagdgmltruxkasdkapwsdngdaknpauplgew 

AUX3^GKEKQAUiX3FRANGT(7roATVNVYGM^ 

PAAYLEKAKGEGPFTAGVNHVIPYEIJFAGDGMLTRIXLK^ 

GWAGKEKQALlIXJFRANGTCyroATVNVYGNK^ 

20 72 

MKIJCSYILVGYTlSTlXTILVVFWAVQKMLIAKGErYFliXjMTW 
SNUV(XiPVEFWlXQTFNEMSHDl^VSFDSLEESER£KGLMUQI^HDIKTPm 
R(n*EW-NKLVEEI^FLTLOTARNQVETTSK^ 
25 LVDNAFKYSAJKnTOEVVAKl^KIX3L5lSVTDEGQGIAPEDl^Nire 
EITVSSQYGLGSTFTLVLNLSGSENKAZ 

MR3(^AQHGLTNSLKDFWIFLLNIGPQIj\FFCQMLRCSRSVEQGTGNHRREFN^ 
30 LELLVNAUWHUJDRLVLTPNQmETORQrVHGU)^^ 
DLVPAKNLPNRQDZ 

MEHLATYFSTYGGAJTAALGrVLAVGLSGMGSAYGVGKACKJSAAAUJCEQPEKFASAUL^ 
35 LQLTPELPLEKGVAYFFVALPlArVGYFSAKHQGNVAVAGMQILAKRPlC^ 

MUCSEKQSRYQMLNEEI^n-LEGETNVl^NUNASAUTC 
GEAAHFQETVIVGD\nTYLNYISCDSLAKSEIVVPMMKiN^ 
40 FTMFEEKS2 

MSVLEIKDLHVE1EGKEIUCGVNLTUCTOEIAAIMGPNGTGKSTUAAIMGNPN 
MGLFl^MQYPSEIPGrTNAEFUlAAMNAGKEDDEKISVREFnXIJ3 
45 QLDrfLEFITALLDElDSGLDlDALKVVSKGVNAMRGEGrc 
LEREGYAKLAEELGYDYKEELZ 

MPYKRQRSFSMAi^KU>SLYMAVVADHSKNPHH(^ 
50 TASASMMTDAVI^KTKQEII^LATireEMVQGQ^ 

MKIQDLLRTOVMUJ)LQATEICrAVtDEMIKNLTDHGYVTOFETFKEGI^ 

LFAKSNKGVDYESUXjQATDLFFML^PEGANDTHl^ 
55 VQAPANDSGDFIVAVTACTTGIAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLT 

DRFDGKPUNRPVADGIRJCTEELINUVUGDTEVYRAAW^ 

ALAFUTCALGVPNENLGNLGSYHEU^MFMKICftAAFGLMt^ 

YAAGGEATSTLAGVSSGFLGALVGGFUGALV^IKKYVKVPRS^ 

NDFUjGLGGGSAVLLGrVXGGMMAVDMGGPVNKAAYVFOT 
60 KDWTKEERNSCLTNHMGLSFnTGAlPFGAADPARAlPSnLGSA 

VLVGAIVSGWYGYLRKPQAZ 

MANKNTSTTRJUU>SKA£LERK£AIQRMLIS^^ 
65 QEGLLSGFFTIFAGLLUFEAYLVWKYGLDKSVUCGTMAQVVTDLTGFR^ 
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IULVGSU.VSPWSVYDIAEFFSRGFAKWWEGHERRKEERFVKQEEKARQKAEKEARLEQ 

ILTEEAVQNU»PlPEEKWVEPEnUK3AELKFPEQEDDSDDEDVQVDFSAKEAI^YKl^ 

NlKILEATFASFCKVTVERAEIGPSVTKYEVKPAVGVRVNRiSNUDD^ 

SFRELWEQSCyrKAENFlXlPLGKAVNGTAJUroLSKMPHIXVAGST^ 

MVEI^VYNDIPHIXIPVVTNPRXASKAU^KVVDEMENRYEIJJAXVGVRNIAGFNA 

LADl^MVASKEVEDAliRUX3KAilAAGIHMIlj\TQRPSVDVISG 

DMLFKPIDENHPVRLQGSFtSDDDVERIVNFUCTQADADYDESro 

SA5MIQRRLSVGFNRATRLMEELEIAGVIGPAEGTKPRKVLQQ2 



MSYFKKYKFDKSQFKLGMRTFKTGUVFLVUJFGFFGWKGU51GALTAVFS 

FFUJfnTHEAFWVTLVWPICTMLTlMTNVAMN^ 

YDIDRIRLFLEKKEKZ 



MNKSEHRHQLIRALrTCNKIHTQAELQALLAENDIQVT^ 

MEDAL VWMRPVQHQVUJCTU^jLAQSFGSUDTUFPDAIATLCGNDVCUICEDAOTAQKCFK 



MKSrKLNAI^YMGntVLNnFPU/rcTTYVARVLDRro 

FSSU^CUCTILTTAVYTI^YPUTn)NPIVKKVYLVMGIQLiAQIFSIEWVN 

^HDrVVVTLVMSLSTUhfYUSYFWTKRD 

YmQRJVTVlAGVVTGAIGVSVPRUYYLGKGDKEAYVSLV^GSRIi^FFIIPUFGLMV 

TSLFAFRTHUVLiynLGSQlLFTNGYEKRITVYTVFAGLLNLGI^S 

IHLGWFSYTVRYSLi^FVAIYFUNFVYPVDMVlNU>FUNTGL^^ 

SZ 



MELFMKrTOYErYKUCKSGLTN^niCVLEYGENV^ 
DDCYFWDLSEIYDAPVUJ r YXGNLI>U.KFPKVAVVGSR 

QNGGKmvIGTGLDVFYPKANKRtQDYlGNDHLVUEYGPGEQPLKFHFPARNRIlAGLCRGVTVAEAKMRSG 
RAMEEGRDVFAIPGSOX)GLSDGCHHUQEGAKLVTSGQDVLAEFEFZ 



MKQLTVEDAKQIELEILDYIDTLOCKHNINYW 
LETDKNYFNNFIKnDSTTKUDTRNTKTYESGIFIDffPIDRFDDPKVIDT 
AFWLLLRPVSPRYFANKIEKEIQKYSRENCKJYMAFIPSKFK^^ 
MTLPPEEKRFYSHEFHAYKLEDZ 



MKINHLTrTQMCDUU)LVSDLTNmQDGEK^ 

KKKTUiDYFFUJSIDLDYSILYRIAEEUffDSNRFAS^ 

LKGQIQKTRQTVTF1SHDEDFUETADTIVHUU,W^ 

MEKHRRVKQNVETAUUTKDSTAGRIXAIOCMKTVI^ 

KENI^IDDRVLVQKUJLTVRGQEKIGUGPNGVGKSTIXAKLQRI^ 

KEELQKIQSHLASLNFSYPEMQHQIRSL^GGQQGKIXLLDLVLR^ 

HDRRFLKEVCSflYRMTEHGUCLVNUDLZ 



N(KPICTFYNLI^QNU>UIX^KEQFERYFEIX^ 

GFPSU>NUOLYPEU5VTIIDSLNKRINFLQLl>Q 

IPYUCVCKSKIXAUCASNAPEELLEAKNAI^ 



MSIKUAVDIDGTLVNSQKETTPEVFSAIQDAKEAGVKVVIATGRPIAGVAKIXD^ 
HEUSE5LTYEDYli)MEFURKI£VHMHAnXOGIYTANRNlGKYTVHESTLVSMPtFYRTPEEM 
LDAAIEKIPAEinrERYSiraCSAPFYl^UOCNVDKGSArrHljVEKLGLTKDETMAIGDEEN 
PEUCKIAKYmCTNDESGVAHAIRTWVLZ 



MTWIILGVlALTVIFVrVSYNGLVKNRMQTKEAWSQIDVQLKRRNDLLPNLIET^ 

TSPAEAMKASDALTRQVSGO^VAESYPDLKASANFVKLQEELTTO 

GFKAADFLQTPEEEKSVPKVDFSGLGDZ 
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MU^IASNKWCTWtLLLVFFLLI^LVGY 
LYHWEDMALVAQIPMPRVFIIDDPALNAFATGSN^ 
Al^AnmSSMAGRMMWWGGAGRRRSDDDRlX}NGLEIiMLVVSLlj^ 
RNPQGMINALDKLDNSKPMSRHVDDASSALYINDPICKCGGFQK^YTHPPISERIERLKQ^ 

MKU^IQEIRKQSEGLNFEQTLDLVDDUlARNQEILOVKDIlJVV^ 

YPVTEVFMEGATNQLDQEVLDDDLVU > tENGELDLAESVSDNIU.MPKVLTAEEEAGQGRSGNDWQIMTEEEYQAQ 
KAVKKEENSPFAGLQGLFDGDEZ 

MKRQLALWFSGGQDSTTCU : WVMQHYETVEAVTFAYGQRHHL^ 

HISYIPDKLCVESKSLKLY^SYRNHGDFHENCI^IGKDLVNLUDPRYXEVWGKFTPRGGIStDPYYNYGKQGTKYEGL 
AEQRLFQHDLYPEKIDNR2 

MTETVEOKVSHSrTGLDILKGrVAAGAVISGTVATQTKVFTNESAVl£KTVEKTO 
SESASTSASESASTSASTSASTSASESASTSASTSISASSTVVG^ 

KRRKRSVDSlEQLL^SIKNAAVFSGhrnVNGAPAJNASUNLAKSETICVYTGEGVOSVYRVP 

20 VTYVNPKTNDLGNtSSMRPGYSlYNSGTSTCrim 

AQMNGFFAKKGYGLTSSWTVPrTGTDTSFTFTTYAARTDRlGINYFNGGGKVVESSTTSQ 

TSASASAST^ASASASTSASASASTSASVSASTSASASASTSASASASTCASESASTSASASASTS^ 

SASTSASASASTSASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSASASASISAS^ 

S ASASTSA5ESASTSASAS ASTSASAS ASTSASAS ASTS ASASTSASESASTSASASASTSASASASTS ASASASTSASASASTS 

25 ASVSASTS AS AS ASTSASAS ASTS AS ES ASTS AS AS ASTSASAS ASTS AS ASASTSASES ASTS AS AS ATO 

STSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTSTS^ 

SA5TSASASARQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQHQPVPRLQRQPVRQLQQVPVLQSQHQQVLQ 
PQHRQVPRLQQAHQHLNQRRQAPQLQQVPVRQPQRRQVRQPQQNO.VHQLQHQRVHRLRRQPVHQSQQVPVRQLPHQ 

QVPRLQQAPVRRLQQVl^PQPQPQPVRQPQQVSQRLNRHQRVRPLCW 
30 QQVlAPQPQRQQVHRLQHQRVRPLQQVLAWJPQRQQVHRLQRQRVRl^HQRVRQPQQAHQLLNLHQPVRQPQHRQ 
APQLQQVPVRQPQRRQVRJU.QQVPVRQPQQVPVRQPQRRQVRRPQPVHU^RHQPVRQPQQVLVHQLQHQRVHRLQH 
QPVHQSQQVPVRQFRINKCLGFSKYZ 

35 MGVETWFYSSlCWI^GtGSVWKFPYNfTAANGGGGFLUFL 

FTGWIGAFALFILl^FYSVIGGWrLVYLGIEFCKLFQLGGTGDYAQLFreilSNPAULGAQAAnLLNIFTVSRGVQKGIERA 

SKVMMPlXFTVFVFnGRSI^LPNAMEGVLYFUCPDFSKLTSTGU-YALGQSFFAI^LGVTVMLTYAS 

ISIVAMNISISIMAGLAIFQARSPFNIQSECGPSUFIVLPQU^KMPFGT^^ 

NSKRAKWSVILGILTFVFGIPSAI^YGVMADVHTFGKTFFDAMDFLVSN^ 
40 ERAWKQGLFQVWLFLLRFFVSSFQSSSLWSSLPNLCNQKGLEZ 

MUGCWQUCDVILLAFLSIFFGGVFVGSGYVYNII^LL^ 
LAEVLYGSQFGLGAl±SGFVQGLGSEFGFlVTTCNRYESWLSLTANSIGriT.VSF^ 
45 YFFCTILVRAIVKLYHQFATGGKAZ 

MVKVATQTTOU^IULETSFlPSlALTI^VVAFaLm^ 
CHTUFVTVaGLVFVSSVSUCEIXLYl^QKGl^RSWSY 
50 TVraWRHLYLRAUAHGYDEHAQUCNSYRTFYIPKKTKUYtXFI^ 

MRKHQt^VHKLTII^MMULDVVLTPtFRIEGMAPM 

FGAU^GLPYKYGRKFHYSALGEn-GTGHGSIVSYPVMVLFTGSAAKLSWFlYTPRFFGATLIGTAlSFlAFRFLK 
55 KVQGYFFSERJDZ 

MQEFTNPFPIGSSSUHCrTNEISCEMLANGItj^LGCKPVMADDSREVLDFTK^ 
QSSLJMVVDAVGVTTSSIRKSLVKDLLDYRFTVUCGNMSBRSLVGL^ 
60 GM$FLVTGPKDLVVSKNQVAVLGNGCTEL£WrTGTGDLVGALTAVFI^QGKTG 
LEEFRYQVLNQLSLLRJUJENWLOTDCGEVYEZ 

MNHKlAlLSDVHGNATALEAVtADAKNQGASEYWLLGDlFl^PGANDLVALIJCD^ 
65 YGl^DPQEVQtXRMTQYLMERMDPATIVWLRSLPLLEKKEIDGLRFSlSHNLPDKNYGGOLLVENDTEKFDQLLDAET 
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DVAVYGHVHKQLLRYGSQGQQnNPGSIGMPYFWEAUCNHRSQYAVIEVEDGEL^^ 
LPFtEMYEELRRDDKYQGHNLELLA5Ll£KHGYVEDVKNFFDFL2 

5 MhfVNQIVRIICTUCANNRKLNETFYIETLGMK^ 

NPUIEGII^KTDSIHRLYKGQNGYAFEIFSPEDOUUHAEDDIA^ 
LESSEIGASLDRPAQGQDLTVDNT\rTWDl^MLm 

10 MKUTTKITKKIEEQIEAGrYPGASFAYFKDNQWTEFYLGQSDPEHG 

IDRLVIDFU > ESDYPOmRQU.THATDLDPnPNRDUTAPELKEAMFHl^RRSQPAFLY^ 

VlUCJX}VWKPWG\fTETKFGPVELAVPTVRGVEAGrVHD^ 

NFSPU)DKERSLAWNLEGDWlX»HTGYTGTnMWNRQKQEATIFLSNRTYEK^ 

15 

MMKKTYNHILVWGVIFYSICWCFCITPQEQmGVGTPGIQHLGRLVFLLTPFNSLWKLGEVSDIGQLCWIFLQNILW 

FLFFPLIFQLLYLFPNLRiCTKKVLl^SFLVSLGIECTQLlLDFFFDF^ 

Z 

20 

MKiPLLTFARHKFVYVLLTLmJVLVYRDVL^ 

VLLGEH3YTELK>0CVLR1^IGREVSYQGLKRKLTLQVASIPCUYLVTVLIIAIITYFFGT 

KSYLmCVLUGIHNAIYFLQiVDYVGNVTRSAnTUrfFLWLGSML^^ 

YlLYrVPYMVLEKYEDNVZ 

25 

MFKVLQKVGKAFMLPIAJLPAAGLLLGIGGAL^NPTTIATYPILDNSIFQSIFQV 
DKGTAAlAGVTGYLVMTATIKALVKLFMAEGSAIDTGVIGALWGrVA 
SSIUGFVFFVIWPPFQQLLVSTGGYISQAGPIGTFLYGFLMRI^GAVG 
30 FAQI^DLAHSGLFTEGTRFFAGRFSTMMFGLPAACLAMYHSVPKNRRKKYAGIJFGVAL^ 
LYVVHAFLDGVSFFIADVUWSIGOTTSGGVIDFTU^ 
EVDSKEISESADSTShTTADYUCQDSLQnRALGGSNMEDVDAC 
AIYGAKAILYKNSINEIU3VDDZ 

35 N0CFRIOACTVl^GAAVUjLAA(^NSGGSKDAAKSGGDGAICTEmvw 
KVIO^IDFKSGPEKITTAXEAGTAPDVLFDAPGRIIQYGK^^ 

SSAPFYMAMNKKMI^DAGVAhfLVKEGWTTDDFEKVUCALKDKGYTPGSLFSSGQGGDQGTRA 
SKYTTDDPKFVKGLEKATSWKDNUNNGSQFDra 
EGKPALEYLVNGFAVFNNKDDKXVAJVSKXHQFUDDKEWGPKDW 
40 YYNTIDGFAEMRTLWFPMU^VShK3DEICPADALKAFrEKANETtKKAMKQZ 

MQSTEKKPLTAFTVISTinXLLTVLFn^FYWTLTGAFKSQPDTrvrp 
SLVTMFLVCATSSLAGYVIJVKKRFYGQRILFAiFlAAMALPKQWL^^ 
45 KQFSENIPTELLESAKHXjCGEIKTFWSVAFPIVKPGFAALAiro^ 
TNYGUMAGAALAAVPIVTVFLVFQKSFTQGITMGAVKGZ , 

MKIMFKNFNMLi^RKIVUXRIVLMMIUNHLI^AVQKQ 
50 LSVlLLESDUDHYHHLmYQSSSFTOYTRKRLVVISKFFTQDU^ 

AUSAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVI.LSUVMLIVSRLEEKFKKGZ 

MGKGEMGKGVIGLEFDSEVLVNKAITXQLANGKTATFLTQY 
55 ARVPGGVNGSKAAVHEVPEFTGGVNGTEPAVHEIAEYKGSDSLVTLTTKKDYTYl^ 
LTAFFLG LFTLGKXREQZ 

60 MKKTFFLL\aGLFCliJ>lSVFAmFKINSYQGDLYl 

QAAKNGAELADVTSEVTEEADGYTVRVYNPGQEGDtVEVDL\nVNl^ 

GDKGAEKLFFHTGKLFREGTIEKSNLDYTIRLDNLPAKRGVEL^ 

DQSKQLVTWVLPSlLSlSLIXSVanrnYRRXTTPSVKYAKNHRLYE 

W3UQATLLDVIDRGNVSIISEGDAVGUU,VKEDGI^FEKDC^^ 
65 [QARGLQUCSSFEEVLNQMQEGVRKRVSFWGLPDYYRPLTGGEKALQVGMGALTU.Pt^GFGU^YSLOVHGYLYLPL 
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PllXJFLGLVI^VFYYV/KUU.DNRDGVL^ 

VSHL^VHQIQVENPDINLYVAYGWHSTFYHSTAQ^HYASVANTAS^ 

5 MKKVRKIFQKAVAGLCCISQLTAFSSlVALAETPETSPAIGKWfKE^ 
EAIFSNIXPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRT^ 
EKNGQHKAI^PNPYERVIPEGTI^KRIYQVNNLDDNQYGIELTVSGKTV^ 
RRAEJUGEATRSLIDKrTSDSENRVALVTYASTIFD^ 
T^KNDrVELKNKVPTEAEDHDGNRLMYQFGATFT^ 
1 U FAPSYQNQU^AFFSKSPNKDGUXSDFntjATSGEKnVRGDGQSYQMFTDKTVYEKGAPA 
VIGDPiNGGYIWUWRESlLAYPFNSOTAKn^ 
KPENYTNVTinTFCILEQLNRYFHTIVTEiaCSlENCT 

qndggixknajcvlyixrtekrirnn'glyun'dekvtltynvrl^ 
rdvrkypeitiskekklgdiefikvnkndkkplrgavfslqkqh^ 
i j kyrl^nsepagykpvqnkpivafqivngevrdvtsivpqdipagyeftndkhyrrnepippkreyprtgglgml^ 
cmmmggvu.ytrkhpz 

mksinkfltmlaallltasslfsaatvfaactttttsvtvhku^ 
zu vmfwt>nrnneiidengqtlgvnidpqtfkl^gampatamkxlteaegakfnta^ 

tltgskavp1eieu>lndvvdakvypknteakp!adkdfkg 

tanwsdrmteclafnkgtvkvtvddvaleagdyaltevatgfdl^ 

vevpesndvtfnygnnpdhg^kpnkpnengdlt^^ 

tvtvngld&nteykfversikgysadyqentagelavknwkdenpkpldct 
15 vlanadnagqylawcadkvsqeekqlvvttkdaldravaaynaltaqqqtqqekekv 

ewvadkdnenvvklvsdaqgrfeitgli^gtyyleetkqpagyaixtsrqkfevtatsysatgqgieytagsgkddat 

kwnkkitipqtggigtnfavagaalmgiavyayvknnkdedqlaz 

30 mtmqkmqkmisriffv^lcfslvwgahavqaqedhtlvujlenyqewsqlk 
qrvrdlhswdenklssfkktsfemtflenqlevshipnglyw 
ttkvkukvdqdhnrlegvgfklvsvardvsekevpugeyrysssgqvgrtlv 
agyavttldtdvqlvdhqlvtttvvnqklprgffvdfmkvdgrtntsu^amfkvmkek 
gkdgrfrvegleygtyylwelqajtcyvqltspvsmckdtrkelvt^ 

35 VWZ 

MSHIYLSlFTSLU^MLGLVNVAQADEYLRiGMEAAYAPFNWTQDDDSNGAVKU^^ 
PLVVKTKWEGLVPA1/TSGKIDM1IAGMSIH-AERKQEUF 
40 YNLUQIPGAKJCETAMGDFAQMRQALEAGVIDAYVSERPEALTAE^ 
ISQINASIETOKDDQVALMDRMLKEQPAEATTTEETSSSFFSQV^ 
TAPUEMCVrYGl^KLVGWVLNVYffilFRGTPMIVQSMVrYYCT^ 
DKGQFEAATALGMTHN<FMRK1VLPQVVRNU.PATGNE^^^ 
IYFVLTFTVTR1LRFIERRMDMDTYTTGANQMQTEDLKZ 

45 

MTQAJLEKHLKKSYGQ^VUCDISLTVHKGEV 
LGMVF<^FNLf^Nl^LEhrnVAQTTVUCR£RTEAEKlAi^ 

PDAILFDEPTSAI^PEMVGEVLKIMQDLAQEGLTMIVVTHEMEFARDVSHRVIFMDKGVIAEEGKPEDLPTNPKEDRTC 
5y EFLQRYUCZ 

MKKYQLU r KJSAVFSYXJTVFSl^QLTLrVQNYW^ 
YSlLTVLVLVFQlSFhTVCjrAKHVQSTAEGWAVUGYSGTN^ 
JJ LPSILFALPHFSSU^LLDl^FATVGIIFAGLTRYTKSlYPSYAVHVINNIVATFPFLLTFtHRVLGZ 

MNKKQWU31^LVAVAAVGI^CGNRSSRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAW 
GFTYFQSTSEADYANNLQQAAGSYI^IFGVGFALNNA^ 

GVAAAKTTKTKQVGFVGGIESEVISRFEAGFKAGVASVDPSIKVQVDYAGSFGOAAKGKTIAAAQYAAGADIVY^ 

GTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGKYTSiU)GKESNFVL^ 

IVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEK2 
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MSKKI^QBVPUSVFLGIIXGAIVMWIFGYDAJW 
PGQAlj\GWlUGWFAUHPDMPRPLMII^TIVULlAGGrVG^^ 
FMQSTDSTIRVGANATYQTPWLAELTGNSRMNIGIFFAIIAVAVlWFMLiCKTTL^ 
SMOSGAI^GLCKjAVECU^NVYVQGSSUIGFNGMA VSU^^ 
VTASUFFVSVHYLIERFVKPKKQVKGGKZ 

MGVKKKUCLTSLlX}LSIXIMTACATNGVTSDn , A£SADFWSKLVYFFAS 
MVASRXMQEAQPRIKAUIEQYPGRDMESRTKLEQEMRX^ 
FtWINLGSVDTn-V^aAAVFTFl^TWUNKAUERNGATTAM 
Fl^NPFKnAEREAVVQAQKDLENRKRKAKKKAQKTKZ 

MVIDPFAiNELDYYLVSHFHSDHIDPYTAAAlUWPKLEHVKnGPYHC^ 
AVESFDRTCLVTIJVNGADETGCELAGI^VTDEEMAQKAVNYIFETPGGTIYHGADS 
YGENPVGlQDKMTSIDLLRMAE^RTKVriPVHYDIWSNFMASTNEILELWKMRKDRLQYDFH 
QHLVEYHHPRGFDDCFEQDSNIQFKALLZ 

MFUGWUSFAhTTYIHDLU3VlFPDSPFLNAFESAlAAPL^^ 

GY1RTDLPEGFDFTISRU.ERIISGIASHWTFSGLAWGVYLLYRATC 

PLAIPVVTA1ALYGFYHAYCFVEKHNELMT2 

MKVEPRCDVl^RMSHFHRILIMELQELVERSWAIRQAYHEUVKHHDSKW 
YDETPYTLEQKLSENIWWLLELSQRLDIDILTEMENFLSDKEKQLNVRTWKZ 

MUJWKQFFl^YUWRSRUTYLI^LAFLVUJHJFLFASLGIYFLYFFFL^ 
SPLEIAl^EKl^R£MELYWR5KA£RKLTDLLDYYTLWVHQaCTP!AASQLLVAEVVDRQLK 
LQYUU.ESFHDDLVUCQVQIEDLVKEHRKYALFHQKGLWNITO 
^DQELCIKOTGIGIKNSDVUtVFERGFSGYNGR^ 

MELNTHNA£tLLSAA^HYPQDEU>EWI^GRSNVGra 

GYGYARVSKKEREKWGCMlEEYLTTRENLRAVVSLVDUlHDPSAOOVQMYEFUCYYElPVlIVATKADiaPRGKW 
ESAIKKKLNFDPSDDFILFSSVSKAGMDEAWDAILEKLZ 

MTXKQLHLVrVTGMSGAGKTVAIQSFEDLGYFTTDNMPPALLPKFLQLVEIKED 
ENQDGLDFKILFUDAADKELVARYKETRRSHPLAADGRILDCK^ 

SDQEQAQSFRiEVMSFGFKYGIPlDADLVFDVRFLPNPYYLPELRNQTGVDEPVYDYVMNHPESEDFYQHLLALIEPILP 
SYQKEGKSVLTIAMGCTGGQHRSVAFAKRLAQDLSKNWSVNEGHRDKDRRKETVNRSZ 

MRKPKrTVtGGGTGSPVtUCSUlEKDVEL^AlVTVADDGGSSGELRKN^ 
FSEDAGAFAGHPLGWJIAGLSEMC^STYNAMQIXSKFFHTTGKIYPSSDHPLTLKAVFQDGT^ 
VYVTNALNDDTPLASRRVVQTILESDMIVLGPGSUTS^^ 
EVLHRHLGRPRDTVLVMEKVPQEYMNSNRTOEYLVQVEH^ 

IQVKKZ 

MKNUmJIRUVNl>DSVFYIVALWHVSNNYSSSMFLGlF^ 

UXNQlSIWlMSLVFISVMASSlSYVIEDVUPQVVEYDm^ 

LALFILL1IJCFRTSNANIENFSFKYYKREVLQGTKFILNNK 

GLGGILGNMtAPIVlKYUCSNQlVGVFLFLNGSSWLVAMro 

TTIDSIISFGMPIGSLVAGTUDLMELVUAISIPYFIFSYIFYTDNGLKEFSIYZ 

MMShnCWCEILIFAILYTVLFMFDGVKliASl^PSAIANYLVYVVlJVLYGSFLFKDRU 
t^LMTVVreFVSEMUCQFVGLDGQGLNQSNlQSTF^ 

GLVFALTHMHSLALSEWlGAVGYLGGGLAFSUYVKEKENIYYPUVHMLSNSLSLIILAiSIVKZ 
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UOCPIIEFKNVSKVFEDS^VL^ 

TVF(£YAU?HMNVFENVAI?LIUJUero^ 

EPLSAU5LKUni>MQYEU^LQQRLG!TFVFVTO 

ILPGTMIEDYLVEFNGKRFEAVDGGMKPNEPVEVVIRPEDLRrr^ 

HSTRKAlVGEEIGU3FEPEDIHIMRU^ErEEEFDAiUEEYVEIEEQEAGU^ 



^MRILRL^UQISI^CFLWKECILSFKQSTAFFIGSM^ 

LLGAVLmDNLFIKhnXQQELVDFUJSFFFl^^ 

LIYIGLGLFFAAYLVPNSVKKEVSFYGHIFRDLVLVIVTLIFFZ 



MVKKIIGMVUUXS\TWGVGVFAYm(^CrrETLAKTY^ 

MILMTVNPKTKKTTMMSLERDILTRIESGN^^ 

DAVCCrTVNNtlXIFPlSlSDQEEFNTISIGVGEQHICGEEALVYAR^^ 

YQEIUCALSDNMQTNIDLSAKSU*NLlXiYXDSFKTIETQ^ 

ETNAVLFEDLFGRAPVGDEDN2 



N^KKQAYV!LALTSFIJ ; VFFFSHSLLEIUDFDWS[FLHDVEKTE 

RLLAGQEVVQVADPDLDASFKSl^GKIJ^tXTEALQKAE^ 

QAIiaDREKMQTQLQS^AILETAQKDLRVtXI^LRPVEl^ 

HULQELlShrTUUUQASCLDVYLYQTDVELQUVVD^^ 

VDIRIPLLOKE2 



N^SUSQG FVWAlLGLGIFMTFRII^PDNfTTEGSFPLGGAVAVTUTC 

IPTLLSGILVMTSCHSIMLLIMGRA^GLLGTKQIQ^ 

OMARS FGIHTGRMElJrtGLVLSNGVtAI^GALUQQEGYADVSRGIGVIW^ 

QFLVWaVIALGFNTTSYLRLYSAU1j\VCLMI1TTKQTIUCGAKLSKZ 



MKKMKVWSTVLATGVALTTT-AACSGGSNSTTASSSEE 
AQLADRVtAEKNNAVADMVFGIGAVDS^RlXJKIXVQYKPKWLDKiDQ^ 

KDWTELGSKYKGKYS1SGLQGGTGRAU^ILVRYLDDKGELGVSEKGWEVAXEYUCNAYTLQKGESSIVKMU)KEDP! 

QYGMMWGSGALVGQKEQNVVFKVMTPEIGVPFVTEQTMVl^SKK^ 

AUCDLPEDTKKFVDQVKPQNIDWEAVGKHLDEWVEKAELEYVQ2 



MIKFDNIQIKYGDFVAlDNLNLDlHEGEFFTTLGPSGCGKSTn.RALVGFU>PSSCSIEVNGTDVTHLEPEKRGIG 

ALFPTMTVFDNLAFGLKVKKVAPDVIKAXVSAVAAJCIKISDQQL^ 

NLDAKUlVDLRiCEUCRLQKEIXirraYVTHI^ 

ETVHEVIXKNTCVFL ED KKGYIRLEKVRFNRETEQDRUCGTIID^ 

ELFTTPSDVLQFZ 



MRHKLNUCDWLIRUJUWFLVTHr^PNFDLVVNVFVKGGEFSLDAVHRVLKS 

CVUT'EYFDKGAJaLKUjYNroLTYGGWI^TG^ 

LFLT>niRSVDYKnEAARNMGAJCPFrVFRKVVLPTLfPTU : ALTIMVFLSG 

NSRDl^LLAULGUTTaLTTMMOEKGGNYlS^ 

AK^NLTLSNFTUNYRUTSNSAAraPFLVSFIYSIU^ 

VSUJTFNQPQFLVU^QILVGSLVILLIAYIVVKIPFSYRM^ 

VlALNFWU.TDFDI^VFLYHPLAQPLGrriRSAGDETATSNAQALVFVYTrVUMIISGTVLYFTQRPGRKVR^ 
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Table 3 

ID201 ♦ 4106.4 

ATGATAAAAAATCCTAAATTATTAACCAAGTC T T ri^TAAGAAGTTTTGCAATTCTAGGTGGTGTTGGTCTAGTCAT 

tcatatagctatttatttgacctttcctttitattatattcaactgg;^^ 
tgtttacggagtatttaaagactaagacatctgatgaaattccaagcttactccagt 
atatctgctcaccttaaaagagatattgtagataagcggctccctcttgtgcatgacttggata^ 
gctatcaaattatatcgtgatgttagatatgtctgttagtacagcagatggtaaacacgtaaccgt^ 

ACGGGGTGGATGTCTACAAAGAAGCAAAGAATATTTTGCTTTTGTA 

TTTTCli i ivji i 1 1 1 iU A i ATrTriA TACTA AACGCTTGCTCAATCCTCTTTT^ 

AATG CAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAG 

ATGGTATGTATGAGCACTTGTTGAAGGTTATTTATGAGTT 

CAAAAGGTTTCCTTTCTCCGCGGAGCATCACATGAGTTGAAAACCCCTTT^ 

TATGCAGCATAATATTGGAGATTACAAAGATCATCCAAAATATATTGCAAAGAGTATAAATAA 

GCCACTTATTAGAAGAAGTACTGGAGTCTTCTAAA7TCCAAGAGTGGACAGAGTGT 

CCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCT 

AGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTT^ 

ATTCAGATAAAAATGGGCGTGTAATCATATCCGAGCAAGATGGCTATCTCTCT 

AGTGACCAAGAACTAGAACATTTATTTGATATATTCTATCATTCT 

TTTGGGTCTTTACATTGTGAATAATATTTTAGAAAGCrATCAAATGGATTATAff 

TGGAATTTAAGATTAGCTTGTAG 



25 MI KNFKLLTKS FLRS FAI LGGVGLVIHI AI YLTFP FYYIQLEGEKFNESARVFTEYLKTKTSDE I PSLLQSYSKSLT 

ISAHLKRDIVDKRL?LVHDU}IKDGKLSNYIVMLDMSV^ 
FSFVFSYFYTKRLI^PLreiSEVTSKMQDLDDNIRFDESRKDEVGEV^ 

QKVS FVRGASHELKTPLASLRI ILENMQHNIGDYKDHPKYIAKS INKIDQMSHLLEEVLESSKFQEWTECRETLTVK 
PVLVD I LS RYQELAHS I GVT I ENQ LTD ATR WMS LSALDKVLTNL I SNAI KYSDKNGRV 1 1 SEQDGYLS I KNTCAP L 
30 SDQELEHLFDIFYHSQIVTDKDESSGLGLYIVNNILESYOMDYSFLPYEHCMEFKISL2 

;P?02 - 4X06.9 

ATGGATAAAATTATTAAAACTATATCAGAAAGCGGAGCCTTTCGTGCTTTTGTCCTTGATAGC^ 
35 CACTGCTCAAGAAAAACAT CAAAC C CAAGCT AG CT CAACTGT AG C GCTTGG7 C GAACTCTTATCGCT AG C CAG ATT C 

TCGCAGCCAATGAAAAAGGAAATACCAAACTTACAGTTAAGGTGTTGGGATCTAGCTCTCTAGGTGCT 

GTCGCTGATACCAAGGGGAACGTCAAAGGCTATGTTCAAAATCCTGGTGTTGACATCAAAAA^ 

AGTCCTAGTCGGACCTTTTGTTGGAAATGGTCAATTCCTCGTTATCACAGA 

CTATAACTCCCCTCATCTCTGGAGAAATCGGTGAAGACCITCCCTTT^ 
40 GCGGTCGGCCTCAATGTCCTTTTGGACGAGGAAGACAAGGTCAAGGTTGCAGGTCGTTTC 

AGGAGCCAAGAAAGAAGAGATTGCTCGCTTTGAAAAACGCATCCAAGAAATGCCAGCTATCTCTACTCTTCTC 

GCGACGACCATATCGAAGCCCTCCTCAAGGCTATCTACGGGGACGAAGCCTACA^ 

CGTTTCCAATGTGACTGTAGCCATGAACGCTTTATGAACGCTCTTGCCAGC 

GAAAG AGGAAGAC CACGGGG CAGAAAT CACTTGTCAATT CTG C CAAACTACTTACAACTTTGATGAAAAGGAC CTGG 
45 AGGAACTCATTCGTGACAAATCTTAA 



^KIIKTISESGAFTlAFVLDSTETVRTAQEKHQTQASSTVALGRTLIASQIlJU^NEKQrrKLTV 
VADTKGNVKGWQNPGVDIKKTATGEVLVGPFVGNC^ 
50 AVGLNVtiDEEDKVKVAGGFTiVQVLPGAKKEEIARFEKRIQEM^ 

RFQCDCSHERFMNALASLPSSDLQEMKEEDHGAEI TCQFCQTTYNFDEKDLEELI RDKSZ 



- 

ATGAAATCAATAACTAAAAAGATTAAAGCAACTCrTGCAGGAGTAGCTGCCTTGTT^ 
TGTATCTGCTCAAGAATCATCAACTTACACTCTTAAAGAAGGTCATACACTTTCAGAAA 

CAACAG TT G AAAAATTGG CAG AAAACAAC C ACATT G AT AACATT CATTTG ATTT ATGTTGAT CAAGAGTTGG TT AT C 
GATGGCCCTGTAGCGCCTGTTGCAACACCAGCGCCAGCTACTTATGCGGCACCAGCCGCTO\AGA 



5 
10 

15 
20 
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AGCT CCAGT AGCAGAAACT C CAGT AG T AAGTGAAACAGTTGTTT CAACTGT AAGCGGATCTGAAG CAG AAGCCAAAG 
AATGGATCGCTCAAAAAGAATCAGGTGGTAGTATACAGCrACAAATGGACGTTATATCGGACGTTACC^ 



MKSITKKIKATLAGVAALFAVFAPSFVSAQESSTYTVKEGDTI^EI^ 

DGPVAPVATPAPATYAAPAAQDETVSAPVAETPVVSETVVSTVSGS EAEAKEW I AQKESGGS IQLQMDVI SDVTNZ 
IP204 - 4117*1 

ATGAAT TTAGGAGAATTTTGGTACAATAAAATAAATAAGAACAGAGGAAGAAGGTT 

TTTTTT AGCT CTGCTATTTTTCTT AGCT AGTCCAGAGGGTC 

ATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGTTTTTGATACT 

GCAGATGCTAACTATGCTGAAAATGAATGGCTAAAGCAAGGTGACGACTATTTTTACCT 

GGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCAAGA 

GGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGATTCT 

TGGTTTTATATCAAAGCAGATGGACAGCAC3CAGAGAAAGAATGGCTCCAAATTAAAGGGAAGGACT 

ATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTATGTGAATGCTAGTGGTGCCAAAGTACAG 

GTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTACATCA 

TTGGTTTTATCTCAAATTTGATGGGAAMTGGCTGAAAAAGAATGGGTCTA 

ACTTCAAATCCGGTGGTTACATGAC^GCCAATGAATGGATTTGGGATAAGG 

GGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTG 

C^CAGCCAATGAATGGATTTGGGATAAGGAATCrrGGTTTTACC 

GGGTCTACGATTCTCATAGTCAAGCrTGGTACTACTTOVAATCT 

GGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAATGAAAATG 

GCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCTT^ 

TTACAAGCGCTAGATGCTAGTAAGGACITTATCCCTTATTATGAGAGTGATGG 
TCAGAATGCTAGTAT^CAGTAGCTTCTCATCTTTC^ 

TGCATTTTGATGGTTTTAAGCTTGAGAATCCL t-L^"riTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAA 
GAATTGGATAAGGTATTTAGTTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTA 
CGAAGAACATTACCATATCAATGCTCTTTATCTCCrrTGCCCATAGTGCCCTAGA 
TTGCCAAAGATAAGAATAATTTCTTTGGCATTACA^ 

GATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAATTATATCG 

AAACAAGGCTTCTGGTATGAATGTGGAATATGCTTCAGACCCT^ 

AAATCAATGAGAAGCTAGGTGGCAAAGATTAG 



MNU3EFWYNKINKNRGRRLMKKVRFI FLALLFFLAS PEGAMASDGTWQGKQ YLKEDGSQAANEWVFDTHYQSWFYI K 
ADANYAENEWLKQGDDYFYLKSGGYMAKSEWVEDKGAFYYI^ 

W FY I KADGQHAE KEWLQ I KGKDYYFKSGGYLLTSQW I MQ A YVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWt 
FENGHYYYIJCSGGYMAANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTAN^ 
GKIAEKEWVYDSHSQAtWYFKSGGYMTANEWIWDKESWFYLKSDGKIAEK 
GYQLGSDGtofLGGKTTNENAAYYQWPVTAJfVYE^ 

LQALDASKDFIPYYESDGHRFYHYVAQNAS I PVASHLSDMETVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAE 
EU)KVFSLLNINNSUJEKKGATFKEAEEHYHINAI*YLLAHSALES 

DVDKGILGATXWIKENYIDRGRT FLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 
ID2Q5 . 4118.1 

ATGAAAAAATTAGGTACATTACTCGTTCT C Tr TC ri ' r CTGCAATCATTCTTGTAGCATGTGCTAGCGGAAAAAAAGA 

TACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACT 

ACAAAATTGACCTTCATAGTATCGTTCCGATTGGGCAAGACCCACACGA^ 

AAAACTTCTGAGGCTAATTTGATTTTCTATAACGGTATCAACCTTGAAAC^ 

GGTAGAAAATGCCAAGAAAACTGAAAACAAAGACTACTTCGCAGTCAGCG^ 

GTCAAAATGAAAAAGGAAAAGAAGACCCACACGCTTGGCTTAACCTTGAAAACGGTATTATTr^ 

GCCAAACAATTGAGCGCCAAAGACCCTAACAATAAAGAATTCTATGAAAAAAATCTCAAA 

AGACAAACTTCATAAAGAAAGTAAGGATAAATTTMTAA^ 

GAGCATTCAAATACTTCTCTAAAGCCTATGGTGTCCCAAGTGCTTACATCTGGGAAATCAATA 
ACTCCTGAACAAATGAAGACCTTGGTTGAAAAACTTCGCCAAACA^ 

GGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAATCTACGCTCAAATCTTTACT 
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CAGAACAAGGTAAAGAAGGCCAOW^ 
AAATAA 



5 MKKLGTLLVLrLSAI I L VACASGKKDTTSGQ KLKW ATNS X IAD ITKNIAGDKI DLHS I VP IGQDPKEYEPLPEDVK 

KTSEANLirYNGINLETGGHAWFTKLVSNAKKT£NXDYFAVSIXJVD 
AKOL£AKDPNNK£FYEKNLKEYTDKLI)KLDK^ 

TPBQ I KTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIP I YAQIFTDS I AEQG KEGDS YYSMMKYNLD KI AEG LA 



ID?0S - 41^.; 

ATGGAATGGTATAAAAAAATCGGAC7TCTTGCAACTACAGGTTTAGCTTTGTTTO 

TGGTAAATCTGCGGATGGCAGVTCGACCATCGACTATTTCAACCAGAAA 

TCACTCGTGAriTTGAGAAGGAAAACCCTAAGATCAAGGTCAAAGTCGTCAATCT 
1 5 AAGACACGCGTTCTCGCAGGAGATGTGCCTGATGTGGTCAATATTTACCCACACT 

AAAAGCAGG7GTTTTTGAAGATTTCAGCAACAAAGACTACCTGAAAOT 

CTGTAAACGAAAAAGTTTACAACGTTCCrrTTTACAGCT 

GAACn^GCTTGAAGGTTCCTGAAACCTGGGATGAATTTGAACAGTTACT 

AC(^TTTGGAATTGCAGGTGCAGATGC7TGGACACTCAATXXj 
20 GAGGAAAAGAAGCAAATCAATACCITCGTTATTCTCAACCAAATGCCATTAAATTGTCGGA^ 

GATATCAAGGTCATGGACATCCT7CGCATCAATGGATCTAAGCAAAAGAACTGGGAAGGTC 

TATCGGAGCCTTCGCACGTGGGGATGTCCTCATGACACCAAATGGGTCTTGGGOjATCAC^ 

AACCGAACTTTAAGATTGGGACCTTCATGATTCCAGGAAAAGAAAAAGGA 

TTGGCATGGTCTATCTCAGCCACCACCAAACATCCAAAAG^ 
25 AGTCATGCAAAAATACTACXIATGTGGACGGATCTCCAACAGCGATCGAAGG 

CGCTTGCTGGTATGACCGAATATGCCTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACT 

GACTTCCATACCTTGACCATGAACTATGTCTTGACCGGTGATAAAOIAGG 

TAACCCGATGAAAGCGGATGTGGATTAG 

30 

MEWYKKIGLLATTGIALFX3IX3ACSNYGKSAIXrrVTIEYFWQKKEMTKTLEEIT 

KTRVLAGD V PDWN I YPQS I EWEWAKAG^EDLSNKD YLKRVKNGYAEKYAVNE KVYNVP FTANAYGI YYNKDKFE 
ELGLKVPETWDEFEQLVKD I VAKGQTPFGIAGADAWTLNGYNQLAFATATGGGKEANQYliRYSQPNAI KLSDP IMKD 
DIKVWDILRINGSKQKNWEGAGYTDVIGAFARGDVI^ 
35 LAWS I SATTKHPKEANAFVEYMTRPEVMQKY YDVDGS PTAI EGVXQAGEDSPLAGMTEYAFTDRHLVWLQOYWTSEA 

DFHTLTMNYVL7GDKQGMVNDLNAFFNPMKADVDZ 

ID207 ♦ 4123.1 

ATGAAGAAAATCAAACCGCATGGACCGTTACCAAGTCAGACTCAGCTAG 
40 TATCCACTTCGGTCCTAATACCTTTTATGACCAAGAATGGGGGACTGGACAGGAGGA 

GTCAGTTGGATGCGCGTGAGTGGGTTCGTGTCCTCAAGGAAAC 

CAC GATG GCTTTGTCCITTATCCGACAGCTCACACAGATT 

CGACTTGCTCCTTGAAGTATCCCAAGCTGCCACAGAGTTTGATATGGAT^ 

CCCATAGTCCCCTCTATCATGTGGACCGAGAAGCGGACTACAATGCCTATTATCTGGCTCAGTTG 
45 TCAAATCCTAACTATGGGAATGCTGGTAAGTTCGCTGAGGTTTGGATGGATGGTGCCAGAG 

GGTT AATT ATGAATTTGAAAAATGGTTTGAAAC CATT CGTGAC CTG CAGGGCGATTGCTTGATTTTTT CAACAGAAG 

GCACCAGTATCCGCTGGATTGGCAATGAACGAGCCTATCCAGGTGATCCACT 

CTAGGAAC^GAAGCAGAGCTGAACTATCrrCAGCACG 

TGTTTCCATCCGTCCAGGCTGGTTCTACCATGAGGATCAGGATCCT 
50 TTCACTCAGTAGGGCGAGGAACTCCACTCTTGCTTAAra 

ATTGAACGACTTTATGAATTTGCGACCTATCGCAATC^GCTCT 

TGGTCCAGCTCTTTCCGCAGACTTTGCTTGTCGCCATTTGACAGACGGCC^ 

CAGACTTGCCCATCCAGTTAGAACTCGACTTAGGTTCTCCTAAAACTTTT^ 

AAGCTAGGGCAACGAATCGCTG C lTri'CATGTGCAAGTAGAGGTGGATGGTGTCTGGCAGGAGl"l"l C GGGTCA 
55 TAC TGTT GGTTACAAACGTCTCTTACGAGGAGCAGTTGTTGAGGCACAGAAGATACGTGTAGT 
AGGCTTTGCCTTTGTTGACCAAGATTTCCCTTTATAAAACTC 
GCATTTGCAGAAAAAAGCCTAGCTGTGGCAAAGGGAGAAAATGCCTATTTTACAGTT 

T C CTTT AGAAGCT AAG ATTTCGATTCAACCGGGGAGAGGTGT CCATGGTG T CGC CT AT CAGGATGAGATTCAAGTCC 
TTGCGTTTCAAACTGGTGAGACTGAAAAAAGTCTGACGCTACCAACCTTGTATT^ 
60 TTCTATCTGAAC CT AACGGTGGATGGTCAGCTTGTGGATCAACTTCAAGTC CAAGTTTCATAA 
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MKKIKPHGPLPSQTQLAYLGDELAAFIHFGPNTFYDQEWGTCQEDPE 
HDGFVLYPTAHTDYSVKVSPWRRGKGDLLLEVSQAATEFDMD>KjVYLSPWD^ 
SNPOTGNAGKFAEVWMDGARGEC^KVNYEFEKWFETIRDL^ 
5 UjTEAELCTYLQHGDPSGTIFSIGEADVSIRPGWFYHEDQDPKSLEELVEI^ 

I ERLYEFATYRNELYKEDLAIX»A£rVSGP W^ADFACRKLTDGLETSSWASDAD PKTFOVI ELREDL 

KLGQRIAAFHVQVEVDGVWQEFGSGHTVGYKRLLRGAWEAQKIRWITESQA^^ 
AFAEKSLAVAKGENAYFTVKRRECSGPLEAKISIQPGTGVHGVAYQDEIQV^ 
F YLNLTVOGQL VDQLQ VQVS Z 

10 

ATGCTTCAAAGACTGAAAAC^TACATTATATGTTTTC^ 
AGTGACTGGGTGGCTTTCTGCCTGGCATTTATTGATTGATATTCTA 
CTAAGAGCCAGCGCCTATCTTGGCTATATTGGGGCCTCATC 
1 5 GTTAATTATATCTG Cirrrrrcn ' nU 'CCTATCCAATCTCTTAAGTTATCATTTCAGCGTACGTAGTTTAAAGTCTTT 

ACATGTCTGGA CriTl ' ClTLTlU CTCAAGTCCTTGTTGTGGGGCA^ 
■ TTCTATTCTATCTACTTGTAATTCTT ^ 

GATTTGAAAGAAGCTCAGGTCAAGCAAAATGCTCAGATAAATCTATTGCTTGCTGAAAATG 
TCAGGATTTGCATGATAGTCTGGGACATACCTTTGCTATGCTGAGTGTCA 
20 AGATGGAGGCTTATCCACAGGTGGAAAAGGAATTAAAAGAAATTCACCAGATCAGCAAGGATCCATGA 



MLERLKRI HYMFW ISLIFMIFPI LSWTGWLSAWHLLID I LFWAYLGVLTTKSQRLSWL YWGLMLTYWGNTAFVA 
VjmWFFFFLSNLLSYHFSVRSLKSUiVWTFLLAQVL^ 
25 DLKEAQVKQNAQim^IAENERSRIGQDLHD^^ 

- 4126.3 

ATGAATGATAAGTTAAAAAT CTl ' L ' ri GTTGCTAGGAGTA rr 1 ' lTr CTACCCATAACCGGTTTCTATGTTCTATTGAT 
ACGAAATGCAGGGCAGACAGATGCCTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGG^ 

30 CAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTCTTTCTC 
TCACGTGTTCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGCT^ 
TAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGCCAAGCAAATTTCT 
ACTATTTCCGCATTATCAATCCAACCAGCAAAAATAAACAAGGAAATGATGTCGGGACCC 
TATTAGVCAGATGACAAGGATTTGGAAGTGATTAACCAAGTCTTTGATGAGGTGGCT 

3 5 AGCAGTTGAAAAGGAAAACTTGAAGAATTTTGTGGTGGCTGAGGATTACCATCAAG^ 
ATGGCTACTGCCATATCAATCTTAATCACGCGGCCTATCCTGTCACT 
GAGGAATTGAAAAAGACCCTGTCACCTGAGGAGTATGCAGTTACCCAGGAAAAT 
CCGTTACTGGGATAAATTTGAATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCT 
AATTTGAGTCTGGTTGTGGCKWCCTAGTTTTACCCAACCCATCACT 

40 TCCTACAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCACCTTGGGCATC 

ACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTCTATCCGCrTTATTCCCAAAG^ 
AAGgcTACGCTTATTTACTAGATTATGTTGATTAA 



45 MNDKLKI FLLLGVFFLAITGFYVLLIRNAGQTDASQ I EKAAVSQGGKAVKKTE ISKDADLHE I YLAGGCFWGVEEYF 

SRVPGVTDAVSG YANGRG ETTKYE L INQTGHAETVHVTYD AKQ I S L KE I LLHYFR I INPTSKNKQGNDVGTQYRTGV 
YYTDD KD LEV I NQVFD EV AKKYDO PLA VEKENLKN FWAED YHQD Y L KKN PNG YCH INVNQ AA Y P V I DAS KYP KP S D 
EEUOCTI^PEEYAVTQENQTERAFSNRYWDKFESGIYVDIATGEPLFSSKD^ 
SYNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFIPKDQMEEKGYAYLLDYVD2 

50 

ATGAAAAAGAAATGGATCTATTATGCTGCTTGTTCT 
CGGCGGTTCGCTAGTCGTTTATTCACCAAACTCAGAGGGCTTAATTGGAG 
ATGOTATCAAAGTAGAACTGATTCAAGCTGGTACTGGAGAACTTTTCAAAAAA 
55 GTAGCT^TCTTATCTTTGGTGGTTCTTATACACAATATACTACCCACGGAGAACT 

AGAAAATGATAATGTTATCAAAGAATATCAAAACACAACTGGCTACTCTACTCCTTATACACTAGATGG7AGTC 

TAATCGTCAACCCTGATTTAACTAAAGGCATGAACATCGAAGGATATAACGATCTTTTCAAACCT 
AAAAT CG CAACTG CTGAC C CAGCAAACTCTT CT AGCGC CTTTGCTCAATT AACAAAT ATGCT ACAAC CT 
TTACAAAGATGATAAGGCTTGGTCTTATGTAAAAGAT L ' ri ' rr CACACTTATTGAT 
' 60 CTAGTGTCTATAAAGTAGTCGCTGATGGAGAAATGGCTGTTGGTCTCT 
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AATGACGGAGCTAACATTAAGGTAGTCTATCCAAAAGAAGGAACCCTCTTCCTACCTCCTAGTGCTGCTATOT 
AAAATCTAAAAATATCGAAAATGCCAAGAAATTTATCGATrrTATTATCT 

CAACCACTACTAACCGTCCTGTTCGTAAAAATGCTAAAACAAGCGAAAACATGAAACCAATTGAC^ 

CTCACTGAAGATTATCIATTATGTCATCAAGAATAAATCAGATATCGTTAAGAAATACAACGAAGTC^ 
CCAATCTAAACAGTAA 



MKKKWMYYAACSSNESADDSSSDKGDGGSLVVYSPNSEGLIGATIPAFEEKYGIKVELIQAGTGELFKKLES 

VADVIFGGSYTQYTTHGELFENyrSXENDNVXKEYQtrrTGYSTPYTLDGSVLIVNPDLTKGMMIHG 

KIATADPANSSSAFAQLTNMIXJAQGGYKDDKAWSYVKD^ 

MDGANIKVVYPKEGTVFLPASAAIVKXSKNMENAKXFIDFIISQEVQDTl^TTTTNRP 
LTEDYDYVIKNKSDrVKKYNEVFTDIQSKQZ 

mn - 

ATGAGTGAGATCAAA ATTA TTAACGCCAAAAAAATCTACCACGATCT 

TCCAAAAGGAAGTCTCTTTACCCTTCTTGGAGCTTCAGGATGTGOT 

TG\ACAGTATCGAAGGTGGAGAATTTTACTTCGATGATACAAAAAT 

GGGATGGTTTTCCAAAACTACGCTATTT?^^ 

GAAGGTTCCAAAAGAAGAATTGATTCAACAGACCAACAAGTA 

GAAAGCCCGATAAACTCAGTGGTGGACAACAACAACGTGTCACCTTGGCATGC 

CTCCTCATGGACGAGCCACTTAGTAATCTGGACK3CCAAACTTCGCTTGGATAT 

ACACGAAGTGGGAATTACAACTGTTTATGTAACCCACGACCAAGAAGAAGCCAT 

TT ATGAA AGATGGGGTGATCCAACAAATCGGCCGACCAAAAGAACTCTATCATAAACCAGCT 

ACCTTTATCCKjACGCACAAATATTATCCCTGCCAATCTTGAAAAACGGAGCGACGGC 

TGGCTATGCCCTTCGAATGCCAGCTCTTGATCAGGTTGAGGAGCAAGCTATTCATGTA^ 

TTATCAAAGATGAATCTGGAGATATTGAAGGAACTATTAGAGATAGCGTCTATCTTGGAC^ 

ATTGAGACAGGTTTTGCCTCAAAAATTCAAGTTAGTGAAGAATC^ 

TATTCGTCTACGAATCAATACGCAAAAATTAAACATCTTTTCTGCAGATGGTTCCCAAAACCTGA^ 
ACCATGGAACGTAA 

MSEIKIINAKKIYHDVPVIENLNITIPKGSLFTLLGASGCGKTTLLRMIAGFTJSIEGGEFTFDDT^ 

GMVFQNYAIFPHLTVRDNVAFGLMQKKVPKEELIQQTNKYIXLM^ 

LLMDEPLSNLEAKLRLDMRQAIREIQHEVGITTVYVTHDQEEAMAISDQIAVMKDGV 

TFIGRTNI I PANLEKRSDGAYIVFSDGYALRMPALDQVEEQAIHVS IRPEEFI KDESGDIEGTIRDSVYLGLNTDYF 
IETGFASKIQVSEESTFEEDLQKGNRIRLRrNT'QKLNIFSADGSQNLIKGVNHGTZ 

10212 - 4ia«.i 

ATGAAGAAAAAATTATTGGCAGGTGCCATCACACTATTATCAGTAGCAACTTTAGCAGCTTGTT 

AGGTGCAGACCTTATCAGCATGAAAGGGGATGTCATTACAXIAACATCAAT^ 

CAGCCCAACAAGTCTTGTTAAATATGACCATCCAAAAAGTTTTTGAA 

GAGGTTGATGATACTATTGCCGAAGAAAAAAAACAATATGGCGAAAACTACCAACGTGTCITC 

GACTCTTGAAACACGTAAAGCTCAAATTCGTACAAGTAAATTAGTTGAGTTGGCAGT^ 

AATTGACAGATGAAGCCTATAAGAAAGCCTTTGATGAGTACACTCCAG^ 

AATGAAGATAAGGCCAAACyUVGTTCTCGAAAAAGCCAAGGCAG 

TTCAACTGATGAAAAAACAAAAGAAAATGGTGGAGAAATTACCTTTGAT^ 

TCAAAAAAGCCGCTTTCGCTTTAGATGTGGATGGTGTTTCT 

AGCCAATATTACATTGTAAAACTCACTAAGAAAACAGAAAAATCATCTAATATTGATGACTACAAA 
AACTGTTATCTTGACTCAAAAACAAAATGATTCAACATTTGTTCAAAGC^ 
ATAK^AGGTTAAGGACCAAGCCTTCCAAAATATCTTTACCCAATATA 
AGTACATCAAACGAATAG 

MKKXLUVGAITLI^VATT^CSKGSEGADLI^^ 

EVDDTIAEEKKQYGENYQRVLSQAOfTLETRKAQIRTS KLVELAVXKVAEAELTDEAYKKAFDEYTPDVTAQI IRLN 
NEDKAKEVLEKAKAEGADFAQLAKDNSTDEKTKENGGE I TFDS ASTEVPEQ VKKAAFALDVDGVSDVI TATGTQAYS 

SQYYIVKLTKK7EKSSNIDDYXEKLKTVILTQKQNDSTFVQSIIGKELQAANIKVKDQAFQNIFTQYIGGGDS 
STSMEZ 
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ATGAAAAAAAATATTAAACAATATGTAACCTTAGGTACTGT^ 

AGCTCACKSAGACTGAAACTTCTGAAGTATCAACACCAAAGTTG^ 

AAGTACAACCTACATCGGATAACTCTTCGGAAGTTACTGTACAACCTTO 
5 TCrTCTACACCCGAAGAAACTCCTGTCTTAGAAAAA^ 

AGAGTTAAAGGATAAATTTACTAGCGGTGACTTTACTGTAGTGATTAAGTACA^ 

AAGCTCTGTTTGGAATATCTAATTCCAAACCCGGTCAACAAAATAGTTATGTAGA 

GAGTTGGGGATGGAAGCGCGTGATACTTCTTCCAATAAAAATAACCTAGT^ 

GTACAAACAAGAGGCTGTGACTAACACTGTTGCAGTAGTAG 
10 ATGGTACAAAAGTAGTAGAAAAGAAAGTGGATAATTT C CTAAAC^TCAAGGATATTAAAGGTATTGATTACTATATG 

CTTGGGGGAGTGAAACGTGCAGGAAAAACGGCGTTTGGTTTTAACGGAACACTAGA 

TGCATTGGATGAAGAAACTGTTAAAAAGATGACAACAAACGCItjTTACTGGACATT^ 

CAACAGGTTCTAACTATTTCCGTATTCCAGTTCTCTATA C T Trr AGCAATGGTCGGGT 

CGTTACGGTGGAACTCATGATTTCTTGAATAAAATTAATATTGCTACAAGTTATA 
1 5 GACTAAACCAAAATTAACATTGGCATTCGATGATTTTGCGCCAGTA 

GTGACTTACAAATCAGCGGTGGTGCAACCTATATTGACTCTGTTATTGTTGAAAAAA^ 

TTTGCTGATCrrGATGCCTGCrrGGAGTAAGTTTTAGAGAAGCAACTAGAAA 

TAATTATTACCTTAAATTAAGGAAACAAGGTGATACTCATTACAA 

ACGATCGTACCAACAGACCAACTGAATTTTCAGTAGATAAAAATTTCGGTAT^ 
20 GTAGAGCGG 

MKKNI KQYVTLGTVWLSAFVANSVAAQETETS EVST? KLVQP VAPTTP I SETVQPTSDNSS ETVTVQPRTVETTVKDP 
SSTAEETPVLEKNNVTLTGGGEN\rrKEUa3KFTSGDFTWIKYNQSSETO 
ELGMEARDTSS^^(NNLVSRPASWGKYKQEAVTNTVAW 
25 LGGVKRAGCTATOFNGTLENI KFFNSAiJ)EETVKKMTT^ 

RYGGTHDFXNKINIATSYSDDNGKTWKPKLTIAFD^ 

FADVMP AGVS FREATRKDSGYKQ IDGNYYLKLRKQGDTD YNYTI RENGTVYDDRTNRPTEFS VDKNPGI KQNGNYLT 
VER 

ID214 - 4185 

ATGAAAAAATTTAGCCTATTACTAGCTATCCTACCATTTTTGGTTGCCTGTGAGAATCAAGCT 
TAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGACG^ 
C AGGCTTTGAT AT CGAAGTTTT AAAGGCAGT AGATG AAAAACT CAGCG ACT ACGAGAT^ 
TGGGAGAGCATCTTCCCAGGACTTGATTCTGGTCACTATCAGGCTGCGGCCAA 
TGCTGAAAAATACCTTTACTCGCTTCCAATTTCCAACAATCCCCT 

CTT CT CTTGAC CAGAT CG CTGGT AAAACAACA CAAG AGG AT AC CGGAACTT CT AACG CTCAATT CATCAAT AACTGG 
AATCAGAAACACACTGATAATCCCGCTACAATTAATTTTTC^^ 
TAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCA 
CAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATXTTCT 
CAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTOUAA 
TTACCTCCCAGATCAATCTCAGTTACAATAA 



MKKFSLUJULPFLVACENQATPKETSAQICTIVLATA^^ 
45 WESIFPGU5SGHYQAAANNI^YTKERAEKYLYSL?ISWPLVLVSN^ 

NQKHTDNPATINFSGEDIGKRII^LAhKjEFOFLVFDKVSVQKI IKDRGLOLSVVDL 
0 FDKALKELYQOGTLEICLSNTYLGGSYLPOQSQLQZ 

ID215 - 4211.1 

50 ATOAAA AAAAATAGTTTATATATCATATCCTCACT Cr ITIT L 'G C 1 I V 'IX j T C Tl A TTTGTCTATGCTACGGCGACGAA 

TTTTCAAAACACTACCAGTGCTAGGCAGGTAAAAACGGAAACCTATACTAATACAGT^ 

TACGCTATAATAGTCATAAGTATTTTATTAGCGGTTTTGCTT 

CTATCGCTAGCTAGTGAAATCCAAGAAAGTACACGTAAATTCAAGGTTAC^ 

AACGATTGAAGTTCCITTGAGCATTGAAGATTT 
55 TCAAGATTGGTAAGAAGGCTCAGAAGGATAAGGTAAAGATTCTACCAGAGATTGACCCTAGTC^ 

GTACAAATTGAAAATCTCATCGTGTCAGATAAAGAACTC^ 

TAAGATTATCGCTGTTTTGCCAACTAGCGAATOTATAACAGGT 

ACCGCAATGGTGTTGTCTTACCGGCAGTTATCACTCCGm 

CCAAGTTCAAGCACATCAAATTCAACrrACAAGCAGTTCATCGGAGACATCT^ 
60 AACGAATTAA 



30 
35 

40 
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MK3CNSLYIISSLFFACVLFVYATATNPQNSTSARQVKTET^^ 
LSLASEMQESTRKFKVTADLTDACTCTIEVPLSIEDLFN 
VQIENVMVSDKEVSITSDQETLDRIDKIIAVLPTSERITGNYSGS^ 
PSSSTSNSSTSSSSETSSSTKATSSKTNZ 

IP216 . 4127.3 

ATtTrTGATTGGCGAAGGGTATCGGACTTTCCCTGTCCTGATT^^ 

TG Crrrn sCAATTATGGCGATTATCATTGCCrTGGCA Al l ' l ' l IC 1TA TCCAAAAACACATTGCAAACCGCTACAGTT 

TCAGCATGAATCTOCTCCATCCAATTGAGCCTAAAAAAACTACAAAAGG 

TACGGAATTATCTTTATCTCTCrmTACCrCAAATCTACTT^ 

ATCTGTTAAAGGTTATTCTCCAAACAGTTACAAGGTAGCTTTCCATCCTATGGGATCTC 

GTATCCCTTTGATTGCCTTAGTTCTAGTTGTTCTATTTGCGACAT^ 

TTGTTTACAAACTTAATTGACAGCCTCAGTATtXrrACCTTATATTC 

TTCTTCCTTCAATACTGGTCTATTTGGAAGTGGA 1' I' IXJl'I'ATGATTACAGGGACIXrCri^'CATCTTGATTATGTCTC 

TATCTGCCAGAAGATTACCTTATACTATTCGCTCATCTGTTGCT 

GCTGCTGAAAGCTTAGGAAGTAGTCGTCTCAATACCTTTGCT 

TTCTGGAGCCATCTTATCTTGA 



MLIGEGYRTFPVLIYTQFISEVGGNSAFAIMAIIIAIJUFLIOKHIANRYSFSMNIXHPIEPKKTTKGKMAAiyA^ 
YGI I FISVLPQI YLI YTSFL2CTSGMVSVKGYSPNSYKVAFHRMGSAIFNTIRI PLIALVLWLFATFISYLAVRKRN 
LFTNLIDSL£WVPYIVPGTVT^IAFISSFNTGLF GSGFTWITGT^ 
AAESLGSSALNTFAKITTPMMLSGIISGAXLSZ 
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TabU4 
ID301 

ATGAATAAGAAAAAAATGATTTTAACAAGTCTACCCAGCGTCGCrAT 
5 TACTTTTGTAAGAGCAGAAGAATCTCCACAAGTTGTCGAAAAATC^ 

CAAAAGCTGATACTGCCAAGAAAGATTACGAAACGGCTAAAAAGAAAGCA 

GATCAGAAGAGAACTGAGGAGAAAGCTCGAAAAGAAGCAGAAGCATCT 

TCAAAATGCATATAAAGAGTACCGAGAAGTTCAAAATCAACGTAGT^ 
1H AATTAACAGAGGTCGACTCTAAAATAGAGAAGGCTAGGAAAGAGCAACAGGACTTCCAAAATAA^ 
1U AGAGCAGTTGTAGTTCCTGAACCAAATGCGTTGGCTGAGACTAAGAA 

ACTAGCTAAGAGAAAATATGATTATGCAACTCTAAAGGTAGCACT^ 

AAATTGAAAAACTTCAATATGAAATTTCTACTTTGGAACAAGA^ 

AAAAAACTTCTTGCTGGTGCGGATCCTGATGATGGCACAGAAGTTATAGAAGCT 

GCTAAACGCTAAACAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAA C ' riVr 
1 5 GTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAG^ 

GTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACT^ 

TCTTCAAAATAAATTAGCTGCTAAAAAAGCTGAGTTAGCAAAAAAACAAAC^ 

TTGATCCTGAAGGTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGA 

CTTCAAAATAAAGTTGCTGATTTAGAAAAAGAAATTAGTAACCTTO 
20 TGATACTGCTGCTCTTCAAAATAAATTAGCTACTAAAAAAGCTGAATTGGAAAAAACT 

CTCTTAATGAGTTAGGCCCTGATGGAGATGAAGAAGAAACTCCAGCGCCGGCT 

GCACCAAAACCAGAGCAACCAGCTCCAGCTCCAAAACCAGAGCAAC 

TCCAGCTCCAAAACCAGAGCAACCAGCTCCAGCTCCAAAACCAGAGCAACCAGCT 

AGCCTACTCAACCAGAAAAACCAGCCACTCCAAAAACAGGCTGGAAACAAGAAAACGGTATG 
25 ACTGATGGTTCAATGGCAATAGGTTGGCTCCAAAACAACGGTTCATGGTACTAOT 

AACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTA 

AAGTATCAGATAAATGGTACTATGTCAACAGCAATGGCGCTATGGCGACAGGCTGGCTC 

TACTACCTCAACGCTAATXXjTGATATGGCGACAGGATGGCTCCAATACAACGGTTC 

TGGTGATATGGCGACAGGATGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCTAACGCT 
30 GTTGGGCTAAAGTCAACGGTTCATPGTACTACCTAAACGCTAACGGTTCA^ 

GATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGT^ 

TGTCAATGGCTTAGGTGCCCTTGCAGTCAACACAACTGTAGATC^ 

AA 

35 MNKKKMILTSLASVAILGAGFVTSQPTFVRAEES 

DQKRTEEKARKEAEASQKLNDVALWQMAVKBYREVQNQRS KYXSDAEYQKKLTEVDS KI EKARKEQQDLQNKFNEV 

RAVWPEPNALAETXKKAEEAKAEEKVAKRKYDYATLKVAIJUCKEV 

KKUAGADPDDGTEVIEAKLKKGEAELNAKQAEL^ 

VADI£KEISNLEII*LGGADPEDDTAAI£NKLAAKXAEIJUCKOTELE 
40 LQNKVADLEKEISNl^ILLGGADSEDDTAAI^NKLATKKAELEKTQKE 

APKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAKPEKPAEEPTQPEKPATPKTGWKQENQWYFYN 

TTCSMAIGWLQNNGSWYYLNAKGAMATGWVK1X2)TWY^ 

YYLNANGDMATGWLQYNGSWYYIifANGDMATGWAKVNGSW 

DTWYYLEASGAMKASQWFKVSDKWYYVNGLGAIJVVNTTVDGYKVNANGEWV2 

45 

IP302 

ATGTTTGCATCAAAAAGCGAAAGAAAAGTACATTATTCAATTCGTAAATT^ 

TGCCAGTCTTGTTATGGGAAGTGTGGTTCATGCGACAGAGAACGAGG 

GGGCAAATGAAAGTCAGGCAGAACAAGGAGAACAACCTAAAAAACTCGATT^ 
50 GTCGAGGAATATGTAAAAAAAATAGTGGGTGAGAGCTATGCAAAATCAACT 

TCTAGTTAACGAGTTGAACAACATTAAGAACGAGTATTTGAATAAAATAGTT^ 

AGATACTGATGATGGAGAGTCGATCAAAAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGA 

AGTTCAGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACAAG^ 

AGCAGAAGCTAAGAAGAAGGTTGAAG AAGCTGAGAAAAAAGC CAAGGATCAAA 
55 CAACCATTACITACAAAACGCTTGAACTTGAAATTGCTGAGTCCGAT^ 

GTAAAAGTGAAAGCTAACGAACCTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGT^ 

TGAGGCTACAAGGTTAAAAAAAATCAAGACAGATCGTGAAGAAGCAGAAGAAGAAG 

AAGAGCAAGGTAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAACACCTC 

GATGCGAAGTCTTCAGATTCTAGCGTAGGTGAAGAAACTCTTCCAAGCCCATC 
60 AGAAGCTGAGAAGAAGGTTGAAGAAGCTAAGAAAAAAGCCGAGGATCAAAAAGAAGAAGATCGCCGTAACTACCCM 
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CCAATACTTACAAAACGCTTGAACTTGAAATTGCTG 

AAAGAGGAAGCTAAGGAACCTCGAAACGAGGAAAAACTTAAGCAAGCAAAAGCGGAAGTTGAGAGT 

GGCTACAAGGTTAGAAAAAATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAACGAAAAGCAGC^ 

ATAAAGTTAAAGAAAAACCAGCTGAACAACCACAACCAGCGCC^ 

AAACCAGAGAATCCAGCTGAACAACCAAAAGCAGAAAAACCAGCTGATCAAC^ 

ATCAGAAGAAGAATATAATCGCTTGACTCAACAGCAACCGCCAAAAACTGAAA^ 

AAACAGGCTGGAAACAAGAAAACGGTATGTGGTACITCTACAATACTC 

AACAATGGCTCATGGTACTACCTCAACAGCAATGGCGC7ATGGCGACAGGATOT 

CTATCTAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTT^ 

GTTCAATGGCGACAGGATGGCTCCAATA(^TCXjCTCATGCT 

TGGCTCCAATACAATGGCTCATCKjTACTACCTAAACGCTAATGGTGAT^ 

TACCrGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAATC 

TCAATGGCTCAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCT 

TAA 

MFAS KSERKVHYS I RKFSVGVASVWASLVMGS VVHATENEGATQVPTSSNR^ 

VEEYVKXIVGESYAKSTKKRHTITVALVNELW^ 

SSDSSTKPEASDTAKFNKPTEPGEKVAEAKKKVEE^KX^^ 

VKVKANEP RDEQKI KQAEAEVESKQAEATRLKXI KTDREEAEEEAKRRADAKEQGKPKGRAJCRGVPGELATPDKKEN 
DAKSSDSSVGEETLPSPSLKPEKKVAEAEKKTOEAiOCKAEDQKEEDRRNYP 

KEEAKEPRNEEKVKQAKAEVESKKAEATRLEKI KTDRKXAEEEAKRKAAEEDKVKEKPAEQPQPAPAPKAEKPAPAP 

KPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPK^ 

NNGSWYYLiJSNGAMATGWU2NNGSWYY^ 

WLQ YNGSW YYLNANGDMATGWVKIXJD WTf LEIAS GAM KASQW FKVSD KWYYWGSGAXAVNTTVDGYGVNANGEVTVN 
Z 

ID3Q3 

ATGGTAAAAAGACGTATAAGGAGAGGGACGAGAGAACCTGAAAAAGTTGTTGTT 

GTATCCTGTATCTGTTACATCTAACCAAGGAACAGATGTAGCAGTAGAACCAGCTAAAG 

ACTGGAAACAAGAAAATGGTATGTGGTATTTTTATAATACTGATGGTTCCATGGCA 

AGTTCATGGTACTACCTCAACAGCAACGGTTCTATGAAAGTCAATCAATC^ 

TXjTAAATACATCGGGTGAGTTAGCGGTCAATACAAGTATAGATGGCTATAG 

GTTAA 

MVKRRIRRGTREPEKVWPEQSSIPSYPVSVTSNQG7DVAVEPA1CAVAJTTDWKQEN 
SSWYYLNSNGSMKVNQWF^VGGKWYYVNTSGELAVNTS I DGYRVNDNGEWVRZ 

lain 

CTGAATACAAGTTTTGTTCATGCTGCTGATGGGATTCAATATCT 

AGAGTATGATGACGCTGACAATGGGGATATTATTGTAAAAGTAGCGACT 

CAAGTACGCGAATTCGTTATGAAAAAGATGAAAOVAAAGACCGTAGTGAAAA 

GGCTATGTAACTACGACAAGGACCTACGATGTTAATCCAGAGACTGGTTATGTT^ 

AAAAGAAGCCACGGATACAGTTATCAAAGTTCCAGCTAAAAGCAAGGTTGAAGAA G 

AATATT3AAGCAGACAATGACCTTTCTGCAGGACAGGAGCAAGAGATTACT 

ACGATAACTTATAATGTAGATGGAAAGAGTGGACAAGTAACTGAGAGTAC^ 

AAGAGTTGTTAAAAAAAGAACCAAGCCCCAA0ITLT1V1X.'C 

CAACTCTTGATAAAAGTCAAGAAGTAGAAGAAGTAGGAGAAATTGGTAAATTACT^ 

LNTSFVHAADGIQYVRDDTRDKEEGIEYDDADNGDI^ 

GYVTTTRTYDVNPETGYVTEQVTVDRKEATDTVIKVPAKSKVEEV 

TITYNVDGKSGQVTEST1^QKKDSQTRWKKRTKPQVLVQEIPIETEYLDGPTIJ5KS 



ID30S 

ATGAAGCTTTTGAAAAAAATGATGCAAATCGCACTAGCCACATTTT^ 
TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAATGGTAGAACCTACTACAAAAAG 

CCTACTGGAGAGTGATAGATGGGAAGTACTATTA T T ri^ TCCTTTATCCGGAGAGATGGTTGTCGGCTGGCAATAT 
ATACCTGCTCCACACAAGGGGGTTACGATTGGTCCTTCTCaVAGAATAGAGATTGCTCT^ 
TTTTGGTCAAGATGGTGTATTACAAGAATTTGTTGGCAAGCAAff 
AACATCATGGGGAAGAATATGATAGCCAAGCAGAGAAACGAGTCTATTATT^ 
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TTAAAAACTGCTTGGATTTATGAAGAGGGTC^ 
CAGATTCLACGGTTGGAGAGCTAGCACCTT 

CAGCTCCATGGTACTATCTAAATCCACCAACTGGCATTATGCAAACAGGTTGGCAATATCTAGCT 
TACCTCCATTCGTCAGGAGCTATCGCAACTGGCTGGTATAAGGAAGGCTCAACTTGGTACTATCTAC^ 
5 TGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTT^ 
GTTGGTATCAGGAAAGTTCGACTTCXTrACTATCTAAATC 
AATGGTAACTGGTACTATGCCTATGATTCAGGTGCTTTAGCT^ 
TAA7GG7GAATGGGTTAAGTAA 

10 MKLLK3Q4MQ I AIATFFFGLIATNTVFADDSEGWQFVQENGR7YYKKGDLKETYWRV I DGKYVYFDPLSGQ'IWGWQY 

IPAPHKGVTIGPSPRIEIALRPDWFTFGQDGVIXJEFVGKQVLEA^ 
LKTGWI YEEGHVT¥TIX}KDGGroSRINRLTVGEL 
YLHSSGAHATGWYKEGSTWYYLDAENGDMRTGWQNLGNKVrY^ 
NGNWYYAYDSGALAVOTTVGGYYLNYNGEWVKZ 

15 

XD306 

TTGGCTGGTAGATATGGTTCTGCIXnTCAGTGTACAGAAGTG 
TACGGTTGTAGAAAAACCACTGAAAGATTTTAGAGCGTCTACGTCT^ 
AATGGTATTTCTATGAGTCTGGTGATGTGAAGACAGGTTGGGTGAAAACAGATGGT 
20 TTAGGTGTCATGCAGACrGGATTTGTAAAATTTTCTGGTAGC^ 

AGGCTGGGGAACAGATGGTAGCAGATGGTTCTACTTTGACGGCTCAGGAGCT 
ATGGCACTTGGTATTACCTTGACX5AAGGVGGTATCATGAAGACAGGT7GGTTT 
GCCTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTr 
AAACTAG 

25 

LAGRYGSAVQCTEVTASNLSTVKTKATVVEKPLKDF51&STS 

LGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDGSRWFYFTCSGAMKTGWYK^ 

AYGSGALAVSTTTPDGYRVNGNGEWVNZ 

30 ID307 

ATGAAAATTTTGAAAAAAACTATGCAAGTTGGACTGACAGTA1 

TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAACGGAAG^ 

CCTACTGGCGAGTGATTGATGGTAAGTACTATTATTTTGATTCTCTATCTGGAGAGATC 

ATCCCGTTrCCATCTAAAGGTAGTACAATTGGTCCTTACCCAAATGGTATCW 
35 GTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAG 1 Tlli ritXjl"ltjGAAAACATTAGAGATTAAAACTAAAGACA 

GTGTTGGAAGAAAGTACGGGGAAAAACGTCAAGATTCAGAAGATAAAGAAGAGAAGOT 

TTTAATCAAAATCATTCTTTAGAGACAGGTTGGCTTTATGATCACT 

TAATGGAGAAAACTACCTTGGTGGTGAAAGACGTGCGGGGTGGATAAACG^ 

CAACAACTGGTATTATGCAAACAGGTTGGCAATATCTAGGTAATAAGT^ 
40 GCCACTGGCTGGTATCAGGAAGGTACCACTTGGTATTATTT^ 

AAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGG 

GGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAATG 

AGCTCAGGTGCITTGGCJIOTGAATACGACCGTAGATGGCT 

45 MKILKKTMQVGLTVFFFGLLCTSTVFADDSEGWQFVQENGRTYYKKCT 

IPFPSKGSTIGPYPNGIRI^GFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGE 

FNQNHS LETGWL YDQSNWYYLAKTE INGENYI<GGERRAGW I MDDSTWYYLD PTTG IMQTGWQ YLGNKWYYLRSSGAM 

ATGWYQEGTTWYYLOHPNGDMKTGWQNLGNKWYYLRSSGAMATGWYQDGSTW 

SSGALAVNTTVDGYSVNYNGEWVRZ 

50 

ID308 

ATGAAAAAGAAATTAACTAGTTTAGCA CTlWA GGCG lTn 

AGAAAGTTCAGGAAATAAAATCCACTTTATCAATGTTCAAGAAGGTGGC^ 
55 C^CATTTTGCCATGGTGGATACAGGAGAAGATTATGATTTCCCA 

GGAATTGAAACGTCTTATAAGCATGTTCTAACAGACCGTGTCTTTCGTC^ 

TCATTTTATTTTGGTGACCCATACCCACAGTGATCATATTGGAA^ 

ACCGAGTCTATCTTAAGAAATATAGTGATAGTCCTrATTACTAAT? 

GATAAGGTTTTACAGACTGCTGCAGAAAAAGGTGTTTCAG 
60 cnTTGGGGACATGGATATTCAGCTCTATAATTATGAAAATGAAACTGATTCATCGGGTGAATT 



SUBSTITUTE SHEET (RULE 28) 



WO 00/06737 



98 



PCT/GB99/02451 



ATGA CAATTCCAATTCCTTGATTAGCGTGGTGAAAGTCAATGGCAAGAAAATTTACCTT^ 

GTTCATGGAGCAGAAGACAAGTATGGTCCTCTCATTGGAAAAGTTGATTTGATG 

CAACAAATCAAATACCAAGGATTTCATTAAAAATTTGAGTCCGAG^ 

GGAAAAATGGTGTTG ATAGTG AGTATGTTAATTGGCTCAAAGAACGAGGAATTGAG 
5 GACT ATGATGCAACAGTTTTTGATATTCGAAAAGACGGTTT^ 

TTTTCAAGCTGGTTGGCATAAGAGTGCATATGGGAACTGGTGGTATCAAGCGCCTGATTCT 

TCGGTTGGAATGAAATUSAAGGTGAATGGTATTACTT7AACCAAACGGGTATCTO 

TXKjAACAATCATTGGTTCTATTTGACAGACTCTGGTGCT^ 

TTATTTTAACAAAGAAAACCAGATGGAAATTGGTTGGATC 
1 0 GTTCTATGAAGACAGGATGGCTTCAATATATGGGGCAATCGTATTACTTTGCTrc 

TGGGTAAAAGATAAAGAAACCTGGTACTATATGGATTCTACTG G TGTCATGAACa 

TCAACATTATTATCTGGAAGATTCAGGAGCTATGAAGCAAGGCTGGCATAAAAA 

AGACAGACCGTTCACGAGCTGTGGGTTGGATCAAGGACAAGGA 

CTTGTGAACGGTAAGACACCAGAAGGTTATACTGTGGATTCAAGTGGTGCC^ 
1 5 ATCIT3CTACAATTAAAACTACAAGTCATTCAGAAATAAAAGAATCCAA 

AAGAAACGAGTCAACATGAAAGTGTTACAAATTTTTCAACTAGTCAA^ 

GAAACGAGTGTAAACAAATCGGAATCAGAACAGTAG 

MKIOaTStJ^VGAFLGI^WYGNVQAQESSCNKIHFIOT^ 
20 GIETSYKHVLTDRVFRRLXEU^QKU)FILVTHTHSDH^ 

DKVI^AAEKGVSVIQNITQGDAHFQFGDMDIQLYNYENETO^ 
VHGAEDKYGPLIGKVDLMKFNHHHDTNKSNTKDFiraTLSPSLIV^^ 

DYDATVFDIRKDGFVNI STS YKP I PS FQAGWHKSAYGNWWYQAPDSTGEYAVGWNE I EGEWYYFNQTGI LLQNQWKK 
WNNHWFYLTDSGASAKNWKKIAGIWYYFNKENQHEI^^ 
25 WVKDKETWYYMDSTGVMKTGEIEVAGQHYYLEDSGAMKQGWHKKANDW 
LVNGKTPEGYTVDSSGAWLVTDVSIEKSATIKTTSHSEI 
ETSVNKSESEQ2 

ID309 

30 ATGGAAATTAATGTGAGTAAATTAAGAACAGATTTGCCTCAAGTCGGCGTGCAACCATO 
CTCAACTGGGAATCCGCATTCAACCGTACAGAATGAAGCGGAT^ 

TCTCGCACATTGTTGGGAACGGTTGCATCATGCAGGTAGGACCTGTTGATAATGGTGCCT 

TGGAATGCTGAGACCTATGCAGCGGTTGAACTGATTGAAAGCCATTCA^ 

CCTTTATATCGAACTCTTACGCAATCTAGCAGATGAAGCAGGT^^ 
35 GAATTAAAACGCACGAGTATTGCACGAATAACCAACCAAACAACCACTCAGACC^ 

GCTAAATGGGGCATTAGC CGTGAGCAGTTT AAGCATGATATTGAGAACGGCTTGACGATTGAA 

GAATGACACTGGCTACTGGTACGTACATTCAGACGGCTCTTATCCAAAA« 

GGTACTACTTTGACAGTTCAGGCTATATGCTTGCAGACCGCTGGAGGAAGCA 

GACAACTCAGGCGAAATGGCTACAGGCTGGAAGAAAATCGCTGATAAGTGGTACTAT^ 
40 GAAGACAGGCTGGGTCAAGTACAAGGACACTTGGTACTACTTAGACGCTAAAGAAGGCGCCAT 

TTATCCAGTCAGCGGACGGAACAGGCTGGTACTACCTCA^ 

GTACAGCCAGATGGCTTGATTACAGTAAAATAA 

MEINVSFOiRTDLPQVGVQPYRQVHAHSTGNPHSTVQNEADYHWRKDPELGFFSH^ 
45 WNAETYAAVELIESHSTKEEFMTDYRLYIELLRNIJVDEAGLPKT^ 
AXWGISREQFIOffilENGLTIETGWQKNDTGYWYVHSDGSYPKDKFE 

DNSGEMATGWKKIADKWYYFNEEGAMKTGWVKYKDTWYY LDAKEGAMVSNAF IQS ADGTGWYYLKPDGTLADKPE FT 
VEPDGLITVK2 

50 ID310 

ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCTTGAT^ 

CTATTTCTCTAAAAAAGAGATGAAAGGAAAAG A G T r CTrr AAAGGAGATGGTTCGGTTCCTTGGTATGTTACTTCGG 
TATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACT 

TTATGGTTTGCTCAATTAGGGATGGT AGT AG CTATTCCACTGACAATTCGTTrrATCTTACCT ATCTTTG CACGGAT 
55 AGACATCGATACGGCATATGATTACTTGGATAAACGTTTTAATTCTAAA 

TTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCTCCCATCAGC^ 

GACATCAATATTTTGATTATTTTGATXjGGTGTAGTTGCAATTGTTT 

ATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTm 

AAGGTGGCTTTGGTGCAGTAGCAGAAACATTAGCAAACGGGAAATTCCTTGCTGCAAATG 
60 AA C ri XiCr ri'CAAACTCCAT CT rTr rA ATTGTGATGGGTTCAGGCTTTACAAT C ri tj 1 C 1 rCCTATGCTTCATCTCA 
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ACAT 1 1 bUrrCAACGl"lTTACTACAACACAAAATATTAAGAAACTTAATAAGATCTTC 
CACTTGCAACTGCAACAGTCTTTTACTTGATTGGTACACGCTTCn'ACCT 
GCAGCrAGCAATATCCCTCAAGACCAAATCTTTATGTACTTTATTGCAT^ 
GATCTTCGCAGCGATTTATGCAGCATCTCAATCAACTATTTC^ 
5 TGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAAT 
GTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCT^ 
TTTCATGGGACTTGTACTTGGTCTACTTGGTGGTGTATTTATTCTTGGA 1 " 

GTGCTTATGCAGCGCTGATTGTATCAACCATCGT CATGGT ATTTATTAAATACTTCCTTCCTCCAACAGCTGTTAGC 
TACTGGGCATATTCATTGATTTCAATCTCTGTATCAGTAGT^ 
1 0 AGTATCTGCACCTAAATATACAACGATTCATGATATTACAGAAATTAAAGCGGA^ 
AA 

MGTTGFTI IDLI I L I VYLLAVLVAG I YFS KKEMKGKEFFKCDGSVPWYVTSVS I FATMLS PIS FLGLAGSS YAGSWI 
LWFAQLGMWAI PLTIRFILPIFARIDIDTAYDYLDKRFNSKALRI ISALLFI IYQLGRMSI IMYLFSAGLSVt/TGI 
15 DINILIIL>!GWAIVYSriXX3LKSVLWTDFIQGVILI^ 

MLLSNS I FLI VMGSGFTILSS YASSQDLVQRFTTTQNI KKLNKMLFTNGVLSLATATVFYLI GTGL YVFYQVQNADS 
AASNI PQDQI FMYFIAYQLPVGITGLILAAI YAASQSTISTGLNSVATSVTTLDIQDVISKNMSDMRRTKIAQFVSLA 
VGLFS IGVS I VMAHSDI KSAYEWFNS FMGLVLGLLGGVFI LGFVSKKANKQGAYAALIVSTIVMVFI KYFLPPTAVS 
YWAYS L I S I SVSWSGY I VSVLTGNKVS AP KYTT I HDITE I KADS S WEVRHZ 

20 

mn 

ATGAAAATTAATAAAAAATATCrrAGCAGGTTCAGTGGCAGTCCTTGCCCTAA G l XJ 
TCACCAAGCTGGTCAGCATAAGAAAGAGTCTAATCGAGTTGC^ 

AAAACTTGACACCAGATGAAGTCAGTAAGAGGGAGGGGATCAACGCCGAACAAATCGTCA 
25 GGTTATGTGACCTCTCATGGAGACCATTATCATTACTAT^ 

GCTCCTCATGAAAGATCCGAATTATCAGTTGAAGGATTCAGACAT^ 

AGGTAGACGGAAAATACTATXTITTACCTTAAGGATGCAGCTCATGCGGATAA 

CGTCAGAAGCAGGAACGCAGTCATAATCACGGGTCAGGAGCTAACG^ 

ACGCTATACAACGGATGATGGGTATATCTTCAATGCATC^ 
30 CTCACGGCGACCATTACCATTACArrCCTAAGAATGAGTTATCAGCTAGCGA 

AATGGGAAGCAGGGATCTCCTCCTTCTTCAAGTTCTAGTTATAATG^ 

CCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAA^ 

CTAAACCCTTATCAGAACGCCATGTGGAATCTGATGGCCTTATm 

AGAGGTGTAGCTGTCCCTCATGGTAACCATTACCACTTTATCCCTTATGAA 
35 TGCTCGTATTATTCCCCTTCGTTATCGTTCAAACCATTGGGTACCAGATTCAAGACCA 

CGACTCCGGAACCTAGTCCAAGTCCGCAACCTGCACCAAATCCTG 

TTGGTCAAAGAAGCTGTTCGAAAAGTAGGCGATGGTTATGTCTTTGAG 

CAAGGATCTTTCAGCAGAAACAGCAGCAGGGATTGATAGCAAACTGGCCAAGC^ 

GAGCTAAGAAAACTGACCTCCCATCTAGTGATCGAGAATTTTACAATAAGGCT 
40 CAAGATTTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTT^ 

CCCAAGTGATAAAGTCAAGTTAGTGGATGATATTCTTGCCTTCTTAGCTCCGAT^ 

AACCAAATGCGCAAATTACCTACACTGATGATGAGATTCAAGTAGCCAAGTTGGO 

GGTTATATCTTTGATCCTCGTGATATAACCAGTGATGAGGGGGATGCC7ATCT 

CTGGATTAAAAAAGATAGTTTGTCTGAAGCTGAGAGAGCGGCAGCCCAGGCTTATGCrAA 
45 CTCCTTCGACAGACCATCAGGATTCAGGAAATACTGAGGCAAAAGGAGCAGA 

GCTAAGAAGGTGCCACTTGATCGTATGCCTTACAATOT 

ACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTW 

CTCTTGAGGATCTTTTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACG 

GGTAACGCTAGCGACCATGTTCAAAGAAACAAAAATGGTCAAGCTGAT 
50 GAAACCTCAGACAGAAAAACCTGAGGAAGAAACCCCTCGAGAAGAGAAACCGCAAAGCC^ 

AACCAACAGAGGAACCAGAAGAATCACCAGAGGAATCAGAAGAACCTCAGGTCGAGAC^ 

CTGAGAGAGGCTGAAGATTTACTTGGAAAAATCCAGGATCCAATTATCAAGTC 

ATTAAAAAATAATTTACTATTTGGCACCCAGGACAACAATACTATTATGGCAGAAGCTGAAAAACT 

TAAAGGAGAGTAAGTAA 

55 

MKINKKYIAGSVAVLALSVCSYELGRHQAGQDKKESNRVAYIDGDQA^ 
GYVTSHGDHYHYYNGKVPYDAIISEELI^KDPtTVQLKDSDIVNEIKGGYVI 

RQKQERSHNHGSGANDHAVAAARAQGRYTTDDGYI FMASDI X EDTGDAYIVPHGDHYHYI PKNELSASELAAAEAYW 
NGKQGSRPSSSSSYNANPAQPRLSENHNLTVTPTYHQW^ENISS 
60 ROVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPIDEK 
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LVKRkVRKVGDGYVF^ENCTSRYIPAJCDLSAET 
QDLLDNKGRQVDFEAUDKIXERl^VPSDKVKLVD 

GYI FDPRD I TSDEGDAYVTPHMTHSHWI KXDS LS EAERAAAQAYAKEKGLTP PSTDHQDSGNTEAKGAEAI YNRVKA 
AKKVPLDRMPYNLQYTVEVKNGSLI I PHYDHYHNI KFEWroEGLYEAPKGYTI^rXATVKYYVEHPNERPHSDNGF 
5 GNASDHVORNKNGQADTNQTEKPSE£KPOTC^ 

LRKAEDLLGKX QDP 1 1 KSHAKETLTGLKNNLL FGTQDNNTIMAEAEKLLALUCES K2 

IP312 

ATGGAGGGATTGGTTAGAGTGCATrTATTGCCrGTATTTGGCGA 
10 TCAACAGCAAGTAAACAAATGGGCTGACAAGGCAAATAAAGGCGAAAAAGGGG 
ATAACATGAATAAGCGTATTTTGAAATATGGCGTAGCTATCCAG 
GTTCCACGCAAACAGCAAAAAGAAAAGGCTGCTGTCAAATAOT 
TTTAGATGCTCTGGATC^TCAAATTATGAG^ 
GCCGTATTAGTQAGGC1V1Xj1 X .TC1T^ 

1 5 ctaaaccgctatcaggaaataaactcacctaaatcaagcgctggtt^ 

acttttactgaaacaatacaaaaaccgtcaacaaattcagtcttggaaatt^ 

ctgtatttacggagaaatatgcttatgcttgtaacttacgc^^ 

actaacgtatc^tttcatggtttccgccatacacatactactatgatgct 

tcagtatagattaggccactctaatttaatgatcac^^ 
20 aagccgtctcaaattatgaaacagctatcaacaatttataa 

meglvrvhllpvfgd ykls klttp i lqqqvkkwadkankgekgafanysllhkmnkri lkygvai qvi qynpandvi 

VPRXQQKEKAAVfCYLDNKSI^nj)YU}AU^SNYEN^ 

LNRYQEINS PKSSAGYRD I P I DKATLLLLKQYKNRQQ IQSWKLGRSETWFSVrrEKYAYAOT*RKRLMKHFDAAGV 
25 TNVSFHGFTWTHTTMMLYAQVSPKDVQYR^^ 
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CLAIMS: 

1. A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 2. 

5 

2. A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 4. 

3. A protein or polypeptide as claimed in claim 1 or claim 2 provided in 
10 substantially pure form. 

4. A protein or polypeptide which is substantially identical to one defined in any 
one of claims 1 to 3. 



15 5. A homologue or derivative of a protein or polypeptide as defined in any one of 
claims 1 to 4. 

6. An antigenic and/or immunogenic fragment of a protein or polypeptide as 
defined 

20 in Tables 2-4. 

7. A nucleic acid molecule comprising or consisting of a sequence which is: 



(0 



any of the DNA sequences set out in Table 1 or their RNA equivalents; 



25 



(ii) 



a sequence which is complementary to any of the sequences of (i); 



(iii) 



a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 
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(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1. 

8. A nucleic acid molecule comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is substantially identical with any of those of (i), (ii) 
ami (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 4. 

9. The use of a protein or polypeptide having a sequence selected from those 
shown in Tables 2-4, or homologues, derivatives and/or fragments thereof, as an 
immunogen and/or antigen. 

10. An immunogenic and/or antigenic composition comprising one or more 
proteins or polypeptides selected from those whose sequences are shown in Tables 2- 
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4, or homologues or derivatives thereof, and/or fragments of any of these. 

11. An immunogenic and/or antigenic composition as claimed in claim 10 which 
is a vaccine or is for use in a diagnostic assay. 

5 

12. A vaccine as claimed in claim 1 1 which comprises one or more additional 
components selected from excipients, diluents, adjuvants or the like. 

13. A vaccine composition comprising one or more nucleic acid sequences as 
10 defined in Tables 1, 3 or 4. 

14. A method for the detection/diagnosis of S.pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one protein or 
polypeptide as defined in Tables 2-4, or homologue, derivative or fragment thereof. 

15 

15. An antibody capable of binding to a protein or polypeptide as defined in 
Tables 2-4, or for a homologue, derivative or fragment thereof. 

16. An antibody as defined in claim 15 which is a monoclonal antibody. 

20 

17. A method for the detection/diagnosis of S.pneumoniae which comprises the step 
of bringing into contact a sample to be tested and at least one antibody as define din 
claim 15 or claim 16. 

25 18. A method for the detection/diagnosis of S.pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as defined in claim 7 or claim 8. 
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19. A method of determining whether a protein or polypeptide as defined in 
Tables 2-4 represents a potential anti-microbial target which comprises inactivating 
said protein or polypeptide and determining whether S.pneumoniae is still viable in 
vitro or in viva. 

20. The use of an agent capable of antagonising, inhibiting or otherwise 
interfering with the function or expression of a protein or polypeptide as defined in 
Tables 2-4 in the manufacture of a medicament for use in the treatment or 
prophylaxis of S.pneumoniae infection 
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